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1.  INTRODUCTION 

This  report  summarizes  a  survey  of  the  literature  on  network  reliability  problems 
that  arise  in  modelling  communication  networks  and  includes  brief  tutorials  on  the  pre¬ 
requisite  graph-theoretic  concepts.  Emphasis  is  placed  on  mobile  communication 
systems  in  which  the  underlying  network  topology  is  subject  to  wide  variations  in  the 
local  environment,  as  opposed  to  questions  of  network  synthesis  and  design.  For  the 
latter  problems,  parts  of  the  network  configuration  (such  as  the  location  of  the 
transmitting  and  receiving  nodes  and  the  selection  of  the  communication  links)  axe  at 
least  to  some  extent  free  to  be  chosen  by  the  system  designer.  In  mobile  command  and 
control  communication  (C^)  systems  this  is  not  the  case,  as  communication  links  may 
be  limited  to  line-of-sight  paths  between  nodes  that  are  constantly  changing  position. 
Moreover,  in  a  hostile  environment  the  network  topology  is  subject  to  link  and  node 
outages  due  to  battle  damage  or  enemy  jamming. 

The  analysis  of  communication  networks  is  primarily  based  on  the  mathematical 
theory  of  graphs,  which  has  found  wide  applications  to  many  different  areas.  This  is  a 
major  branch  of  combinatorial  mathematics  that  has  been  studied  for  centuries,  and  the 
research  literature  in  the  field  is  enormous.  Recent  activity  in  graph  theory,  however,  is 
more  intensive  than  ever.  The  emphasis  is  on  the  development  of  efficient  algorithms, 
and  many  of  the  most  important  results  here  have  been  discovered  only  in  the  past 
decade.  This  aspect  of  the  theory  is  now  more  important  than  ever  because  of  the 
current  importance  of  parallel  or  concurrent  data  processing. 

The  main  aspect  'of  graph  theory  in  communications  is  the  notion  of  the 
connectivity  of  a  graph,  and  this  report  concentrates  primarily  on  this  aspect  of  the 
subject.  For  a  graph  to  be  connected  the  existence  of  a  path  from  a  given  node  to  any 
other  node  is  required,  and  this  depends  on  the  details  of  the  node-to-node  configuration 
that  describes  the  graph.  This  is  a  completely  deterministic  question  which  can  be 
quickly  answered  once  the  graph  has  been  specified  and  represented  in  a  form 
appropriate  for  processing  in  a  digital  computer.  There  are  a  number  of  efficient 
algorithms  for  determining  graph  connectivity,  generally  implemented  by  a  systematic 
traversal  of  the  graph  beginning  at  a  specified  initial  node.  More  quantitative  versions 
describe  how  the  graph  can  become  disconnected  due  to  the  loss  or  deletion  of  some  of 
its  nodes  or  edges. 
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Of  more  importance  in  modern  communication  theory  is  the  idea  of  a  probabilistic 
network,  which  is  simply  a  graph  in  which  the  individual  nodes  and/or  edges  are  subject 
to  failure.  The  general  assumption  here  is  that  the  failure  probabilities  are  known  and 
the  individual  component  failures  are  statistically  independent.  (This  second 
assumption  is  frequently  unrealistic,  but  is  necessary  to  simplify  the  mathematical 
analysis  of  the  networks.)  In  this  setting  the  underlying  (deterministic)  graph  may 
itself  be  connected,  but  the  unreliability  of  its  components  give  rise  to  failure  events  for 
which  connectivity  is  lost.  It  is  the  overall  probability  of  such  failures  that  measure  the 
reliability  of  the  network. 

Unfortunately,  the  exact  calculation  of  this  type  of  unreliability  measure  is 
generally  a  computationally  intractable  problem  and  can  be  carried  out  only  for 
networks  of  relatively  limited  size  (e.g.,  20-25  nodes).  Over  the  past  few  years  a 
number  of  approaches  to  this  problem  have  been  considered,  and  these  methods  are 
constantly  being  modified  and  improved.  The  most  important  of  these  are  described, 
along  with  their  relative  advantages  and  disadvantages,  but  they  are  all  subject  to  the 
computational  difficulty  mentioned  above.  That  is,  as  the  size  of  the  network  under 
consideration  grows,  the  demands  made  on  the  available  computational  resources  (run 
time  or  storage  capability)  become  insurmountable.  This  seems  to  be  inherent  in  the 
reliability  problem  itself,  not  a  defect  of  the  particular  algorithm  being  applied  to  its 
solution. 

While  it  is  possible  to  restrict  the  node-edge  configuration  of  the  underlying  graph 
to  such  an  extent  that  efficient  reliability  algorithms  do  become  available  for  such 
networks,  the  constraints  involved  are  .so  severe  as  to  preclude  their  use  in  modeling 
mobile  radio  networks.  Thus  future  algorithmic  improvements  may  allow  the 
computation  of  reliability  for  ever  larger  networks,  but  the  combinatorial  complexity  of 
the  problem  %vill  eventually  overwhelm  the  capability  of  any  computing  equipment 
being  used. 

For  this  reason  a  number  of  researches  have  been  devoted  to  developing  algorithms 
which  would  give  reasonable  bounds  on  network  reliability.  While  these  have  met  with 
some  success,  in  general  the  intractability  of  the  computation  is  still  present  and  the 
usefulness  of  most  of  these  methods  to  general  network  analysis  is  questionable. 
•Another  possibility  is  that  of  simulating  the  network  success/failure  events  by  Monte 
Carlo  methods,  although  these  have  slow  convergence  properties.  A  more  promising 
approach  is  the  development  of  parallel  processing  and  graph  algorithms  to  be 
implemented  on  parallel  computers.  This  area  is  quite  new  and  much  more  work 
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remains  to  be  done  here. 

There  are  many  other  graph- invariant  properties  that  are  closely  related  to 
questions  of  graph  vulnerability  or  survivability,  and  a  discussion  of  these  makes  up  the 
concluding  section.  It  is  not  clear  at  this  point  which  of  these  are  most  applicable  to 
mobile  communication  networks,  although  many  have  proved  to  be  useful  in  other 
areas.  These  quantities  are  also  the  objects  of  much  contemporary  research,  including 
the  search  for  effective  algorithms  for  their  calculation. 
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G  =  G{V,E) 

a  graph  (directed  or  undirected) 

y{G) 

set  of  nodes  (vertices)  of  G 

E{G) 

set  of  edges  (links)  of  G 

\s\ 

number  of  elements  of  a  set  5 

P 

1  V’(G)  1 ,  number  of  nodes 

Q 

j  jE’(G)  j ,  number  of  edges 

A'p 

complete  graph  on  p  vertices 

deg(u) 

degree  of  vertex  v  eV{G) 

<5(G) 

minimum  degree 

^(G) 

maximum  degree 

Diam(G) 

diameter  of  G 

dist(tf,  i’) 

hop  distance  between  nodes  u  and  v 

Ngbd(y) 

set  of  vertices  adjacent  to  node  u 

k(G) 

vertex  connectivity  of  G 

A(G) 

edge-connectivity 

c(G) 

number  of  components  of  G 

G-e 

subgraph  with  edge  e  deleted 

Gje 

subgraph  with  edge  e  contracted 

G  ~  V 

subgraph  with  node  v  deleted 

r-rl 

smallest  integer  n>  x  (ceiling) 

[jj 

largest  integer  n  <  x  (floor) 

GLOSSARY  OF  NOTATION  USED 
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2.  GRAPHS  AND  NETWORKS 

Communication  networks  are  usually  modelled  by  graphs  or  digraphs  where  the 
nodes  or  vertices  of  the  graph  are  the  transmitting/receiving  facilities  (possibly  used 
simply  as  relays),  while  the  edges  of  the  graph  indicate  the  possibility  of  communication 
between  nodes.  In  this  section  we  discuss  the  basic  graph-theoretic  concepts  of  most 
interest  and  indicate  how  these  apply  to  modelling  communication  networks.  The 
literature  in  graph  theory  is  quite  extensive  and  references  [TIO]  give  a  number  of 
recent  texts  that  introduce  the  field  at  various  levels  of  sophistication.  In  particular, 
the  book  by  Harary  [7]  has  become  a  standard  since  its  original  publication  in  1969. 

The  number  of  interesting  quantities  that  are  associated  with  a  graph  is  very  large, 
but  we  limit  ourselves  to  those  with  immediate  application  to  the  possibility  of 
communicating  within  the  graph  and  to  measuring  its  performance  and  reliability.  We 
also  indicate  how  graph  data  can  be  structured  in  a  digital  computer.  There  are  many 
ways  of  doing  this,  some  of  which  are  more  appropriate  to  mobile  communication 
networks  than  others.  Examples  of  these  representations  are  given  for  a  simple  graph. 

2.1  BASIC  DEFINITIONS 

A  graph  G  =  G(V'.  E)  consists  of  a  set  \'  =  V{G)  oi  p>  I  nodes,  or  vertices, 
together  with  a  set  £  =  E{G)  of  <31  >  0  unordered  pairs  of  distinct  nodes.  We  say  that  G 
has  order  p  and  size  q  and  refer  to  G  as  a  (p,  ql-graph.  The  elements  of  E  are  the 
edges,  or  links,  of  G.  .A.n  edge  c  ={«.  c}  is  said  to  join  u  and  v  in  G.  and  we  write  this 
as  e  =  uv.  This  definition  excludes  self- loops  (an  edge  joining  a  node  to  itself)  and 
parallel  edges  (two  or  more  edges  joining  a  pair  of  vertices). 

The  edge  uv  is  said  to  be  incident  to  the  nodes  u  and  v.  which  are  adjacent  nodes, 
or  neighbors.  Two  edges  are  said  to  be  adjacent  if  they  are  incident  to  a  common  node, 
a  node  is  isolated  if  no  edge  is  incident  to  it,  and  we  define 

Ngbd((i)  =  {?’  :  uv  e  £(G)},  (2.1) 

the  set  of  neighbors  of  node  u.  The  complete  graph  on  p  nodes,  denoted  by  has 
1  —  P(P  ~  l)/2  so  that  every  pair  of  distinct  nodes  is  adjacent. 

The  degree  of  a  node  ;■  €  V'(G).  deg(r).  is  the  number  of  edges  incident  to  it,  and  it 
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is  always  true  that 

^  deg!-.)  =  2q.  (2.2) 

V  €  VdG) 

We  also  define  the  minimum  and  maximum  degrees  as 

<^(G)4Min{deg(i;):  t;  G  V{G))  (2.3) 

A(G)4Max{de^(u):t^  €  K(G)}. 

The  graph  is  said  to  be  k-regular  if  S{G)  =  A(G)  =  k.  (2.4) 

Two  graphs  are  isomorphic  if  there  exists  a  one-one  correspondence  between  their 
node  sets  that  preserves  adjacency.  .4n  invariant  of  a  graph  (G)  is  a  quantity  associated 
with  G  which  has  the  same  value  for  all  graphs  isomorphic  to  G.  Thus  p,  q,  6,  and  A 
are  graph  invariants,  as  is  the  property  of  being  Ar-regular,  0  <  k  <  p.  Graphs  are 
generally  represented  by  point-line  diagrams-—  the  nodes  given  by  distinct  points  and 
the  presence  of  an  edge  joining  two  nodes  indicated  by  a  line  segment  or  arc  between 
them.  Figure  2.1  shows  such  diagrams  for  a  pair  of  isomorphic  graphs.  (The  node 
correspondence  is  given  by  1— +2'  ,2—+!'.  3— *3',  4— *5',  and  5— +4'.) 


4' 

FIGURE  2.1  G,  .4ND  G,  ARE  ISOMORPHIC 


.4  subgraph  of  G  is  a  graph  all  of  whose  edges  and  nodes  are  edges  and  nodes, 
respectively,  of  G.  It  is  a  spanning  suf-graph  if  it  contains  all  the  nodes  of  G.  Given  a 
subset  Uq  C  U(G)  of  nodes,  the  subgraph  Gq  of  G  generated  by  Vq  is  the  largest 
subgraph  of  G  with  node  set  Vq.  (That  is.  two  nodes  in  Uq  are  joined  in  Gq  if  and  only 
if  they  are  joined  in  G.) 

.41so.  for  any  node  set  V'q  C  U(G)  we  define  G  —  Uq  to  be  the  largest  subgraph  of 
G  which  contains  no  nodes  belonging  to  V'q-  (Note  that  it  cannot  contain  any  edges  of 
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G  which  are  incident  to  a  node  in  belongs  to  a  graph  then  the  vertices 

which  it  joins  must  belong  to  the  node  set  of  that  graph.)  Similarly,  for  a  set  of  edges 
Eq  C  E{G),  G  —  Eq  denotes  the  largest  .subgraph  of  G  which  has  no  edges  in  Eq.  Finally, 
since  any  {p.  (/)-graph  G  may  be  thought  of  as  a  subgraph  of  A'p,  we  can  consider 
enlarging  G  by  adjoining  edges — this  will  give  the  smallest  subgraph  of  A'p  which 
contains  G  and  the  adjoined  edges. 

2.2  DIRECTED  GRAPHS 

directed  graph,  or  digraph,  is  defined  exactly  as  a  graph  G  —  G{V .E)  except 
that  the  edges  E  =  E{G)  now  consist  of  ordered  pairs  of  distinct  nodes.  We  still  write 
t  =  {u.v)  e  E(G)  as  e  =  ni\  but  it  is  now  considered  as  a  link  from  u  to  v.  Its  presence 
in  EiG)  does  not  imply  that  the  opposite  link  t'  =  vu  is  also  in  E{G).  If  this  is  always 
the  case  then  we  say  that  G  is  a  symmetric  digraph,  and  asymmetric  if  it  is  never  the 
case.  -Associated  with  G  is  the  underlying  graph  G^  of  G  obtained  by  ignoring  the 
directions  of  the  edges  in  the  digraph. 

Gi  veil  a  node  v  G  V'(G)  we  define  the  indegree  of  r  to  be  the  number  of  nodes 
a  G  I  (G)  tor  which  uv  G  E{G).  Similarly,  the  outdegree  of  u  is  the  number  of  edges  in 
E{G)  which  go  from  v  to  some  other  node  a  G  I’(G).  The  deg..ee  of  v  is  the  sum  of  the 
indegree  and  outdegree.  and  we  always  have 

^  indeg(i')=  ^  outdeg(  t>)  =  <?.  (2.5) 

feV'(G)  cenc) 

The  complete  digraph  on  p  nodes  has  q  =  p{p  —  l)  edges,  one  from  each  node  to 
every  other  node.  We  will  continue  to  denote  it  by  A'p —  it  should  be  clear  from  the 
context  whether  we  are  referring  to  undirected  or  directed  graphs. 

Most  of  the  definitions  given  previously  for  undirected  graphs  go  over  to  directed 
graphs  without  change.  The  major  differences  between  the  two  types  are  in  their 
connectivity  properties,  which  will  be  discussed  later  in  this  report.  References  [8]  and 
[10]  are  devoted  exclusively  to  digraphs,  but  all  of  the  other  texts  cited  for  graph  theory 
discuss  them  to  some  extent. 

2.3  WEIGHTED  GRAPHS 

■A  weighted  graph  is  simply  a  graph  G  =  Gd'.  £)  together  with  a  weight  function 
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w.  E{G)—^U.  where  a’(e)  is  the  weight  assigned  to  the  edge  e  G  E{G).  Typically  we 
shall  assume  that  u’(e)  >  0  for  all  edges  e.  In  applications  this  might  represent  the  cost 
of  a  link  in  a  transportation  network  or  the  capacity  of  a  communications  channel  in  a 
radio  network. 

It  is  also  possible  to  consider  graphs  with  weights  assigned  to  the  nodes  v  6 
representing  the  costs  associated  with  storage  locations  in  a  transportation  network  or 
the  capacity  of  central  processing  nodes  in  a  data  network.  In  our  applications  we  shall 
generally  consider  only  functions  defined  on  the  edges  of  the  graph. 

Similarly,  the  definition  of  a  weighted  digraph  is  the  same.  The  main  point  to  note 
here  is  that  the  weight  of  an  edge  between  two  nodes  will  depend  on  the  direction  of  the 
edge.  Even  if  the  digraph  is  assumed  to  be  symmetric,  this  is  not  to  be  required  of  the 
weight  function.  That  is.  if  u,v  €  V'(G)  are  such  that  the  edges  e  =  uv  and  e'  =  vu  both 
belong  to  E{G),  it  is  not  necessarily  true  that  iv{e)  =  w{e'). 

Thus,  in  a  transportation  system  it  is  less  expensive  to  move  material  downstream 
than  to  oppose  the  current  in  the  opposite  direction.  Our  main  interest  will  eventually 
be  in  communication  networks  given  by  a  .symmetric  digraph  with  a  weight  function 
P:£(G)— +[0. 1].  Here  P{t)  represents  the  probability  that  the  communication  link 
e  G  ^(G)  is  operational,  and  this  probability  must  be  allowed  to  depend  on  the  direction 
of  the  link  in  order  to  model  such  things  as  local  jamming,  directional  antennas  for 
transmission  and/or  reception,  etc. 

2.4  GRAPH  REPRESENT.4TI0NS 

To  represent  a  graph  in  a  computer  the  data  must  be  organized  in  the  machine  so 
as  to  reflect  the  relations  among  the  nodes  V{G)  as  specified  by  the  collection  of  edges 
E{G).  One  way  to  do  this  is  to  use  the  so-called  adjacennj  matrix  .4  =  -4(G),  a  pxp 
matrix  defined  as  , 

»,,=  I' 

|0  otherwise. 

Here  the  nodes  have  been  mapped  over  to  the  set  of  integers  {1, _ p).  (That  is.  the  z-th 

node  read  into  the  machine  is  assigned  to  the  integer  i.)  This  matrix  has  zeros  on  the 
main  diagonal  (no  self-loops)  and  is  symmetric  if  G  is  a  graph  or  a  symmetric  digraph. 
(That  is.  —  for  all  -  so  only  ^/(7— l)/2  matrix  entries  need  to  be 

stored. ) 
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Other  variations  of  this  technique  are  possible.  If  it  is  desired  to  keep  track  of  the 
edges,  then  the  ij-entry  of  .4  would  be  the  number  of  the  edge  e  =  ij  in  the  order  in 
which  it  appeared  in  the  input.  Also,  for  a  weighted  graph  the  matrix  could  contain  the 
weights  of  the  corresponding  edges.  Graph-processing  algorithms  are  discussed  in  more 
detail  below  and  in  references  [11-28],  but  it  is  important  to  note  that  the  way  a  graph 
is  represented  in  the  computer  can  have  a  large  effect  on  the  efficiency  of  an  algorithm. 

For  example,  one  problem  with  the  adjacency  matrix  representation  is  that  if  there 
are  relatively  few  edges  in  the  graph  (as  compared  with  the  complete  graph  A'p),  then 
most  of  the  entries  in  A  will  be  zero,  thereby  using  a  lot  of  unnecessary  computer 
storage.  In  this  case  one  would  prefer  to  use  the  so-called  adjacency  list  representation, 
where  each  vertex  i  is  associated  with  a  linked  list  containing  the  vertices  which  are 
adjacent  to  it,  (For  a  digraph  we  would  maintain  two  sets  of  lists,  one  for  nodes 
adjacent  to  an  edge  from  i  and  the  other  for  nodes  adjacent  to  an  edge  going  to  i.)  For 
dense  matrices,  however,  where  the  number  of  edges  is  close  to  that  for  A'p,  the  matrix 
representation  is  to  be  preferred. 

Other  considerations  are  also  important.  If  we  consider  G  as  a  subgraph  of  A'p  and 
enlarge  it  by  adjoining  edges,  or  decrease  it  by  removing  some  edges  and/or  nodes,  it  is 
much  easier  to  reflect  such  changes  in  the  adjacency  matrix  than  to  update  the  linked 
lists  of  vertex  connections.  However,  the  matrix  representation  can  be  costly  in 
processing  the  graph  data.  In  many  cases  we  are  given  a  vertex  v  6  F(G)  and  are  faced 
with  the  problem  of  selecting  some  subset  of  Ngbd(t^),  the  neighboring  nodes.  The 
adjacency  lists  are  clearly  superior  for  such  a  purpose,  whereas  in  the  adjacency  matrix 
we  would  have  to  scan  across  a  whole  row  (or  column)  of  ,4  which  may  consist  almost 
entirely  of  zeros. 

Nevertheless,  we  shall  prefer  to  use  the  matrix  representation.  The  main  reason  for 
this  is  our  interest  in  modelling  communication  networks  in  widely  varying 
environments.  Under  benign  conditions  with  most  nodes  within  line-of-sight  of  one 
another  and  with  no  hostile  activity,  almost  all  of  the  links  will  be  operational  and  the 
linked  list  adjacencies  have  no  storage  advantage,  but  will  exact  a  higher  cost  in 
overhead.  Under  jamming  conditions  and  more  stressful  environments  some  links  may 
drop  out  or  be  disabled,  but  the  adjacency  matrix  is  more  easily  updated  to  take 
account  of  such  changes.  The  penalty  for  use  of  this  representation  is  that  one  must  use 
graph  processing  algorithms  which  are  inefficient  in  dealing  with  sparse  graphs.  These 
various  representations  are  shown  for  an  example  graph  in  Figure  2.2. 
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Example  Graph: 


8  Vertices 

10  Ed  es  (vs  28  edges  in  the  complete  graph) 
Edge  Listing: 


1  2 
1  4 

1  5 

2  3 
2  6 

3  4 
3  7 
5  6 
5  8 
7  8 

Adjacency  Matrix: 


0  10  110  0  0 
10  10  0  10  0 
0  10  10  0  10 
1  0  1  0  0  0  0  0 

1  0  0  0  0  1  0  1 

0  1  0  0  1  0  0  0 

0  0  1  0  0  0  0  1 

0  0  0  0  1  0  1  0 

Adjacency  Lists: 

Node  Ngbd(Node) 

1 
2 

3 

4 

5 

6 

7 

8 


FIGURE  2.2  GRAPH  REPRESENTATIONS 


2  4  5 

1  3  6 

2  4  7 

1  3 

1  6  8 

2  5 

3  8 
5  7 
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3.  GRAPH  CONNECTIVITY 

Reliability  in  communication  networks  is  a  measure  of  the  ability  of  nodes  of  the 
network  to  establish  communication  with  one  another,  either  directly  or  through  relay 
facilities.  In  particular,  a  communication  path  must  exist  between  any  two  nodes  of  the 
network,  for  otherwise  one  part  of  the  network  would  be  isolated  from  another.  This  is 
determined  by  the  connectivity  of  the  underlying  graph  or  digraph  used  to  model  the 
network.  Moreover,  we  would  like  to  have  a  large  number  of  alternative  paths  between 
nodes  so  that  the  failure  of  a  few  network  components  would  not  result  in  such  isolated 
nodes. 

This  section  defines  the  basic  notions  of  connectivity  in  graphs  and  digraphs  and 
discusses  some  graph  invariants  to  quantify  network  connectivity  and  vulnerability. 
Network  reliability  is  a  difficult  problem  because  it  is  concerned  with  the  overall  global 
performance  of  the  network,  while  the  underlying  graph  is  defined  explicitly  only  in 
terms  of  the  local  interconnections  among  a  large  number  of  components.  Connectivity 
is  an  extremely  important  property  of  a  graph  and  any  introduction  to  graphs  will 
include  some  discussion  of  it.  For  its  relationship  to  networks  see  the  text  by  Colbourn 
[14],  which  is  the  best  available  reference  to  this  subject. 

3.1  COMPONENTS  OF  A  GRAPH 

A  walk  in  a  graph  G  is  an  alternating  sequence  of  nodes  and  edges 

Cq,  Cl,  f’l,  62,  ....  e„,  r„  (3.1) 

where  =  ffc-i  for  I  <k  <  n.  It  is  said  to  connect  Vq  with  v„  and  has  length  n.  A 
walk  is  closed  if  Vq  =  otherwise  open.  It  is  called  a  trail  if  its  edges  are  distinct,  and 
a  path  if  its  nodes  (hence  also  its  edges)  are  distinct.  A  circuit  is  a  closed  walk  and  a 
cycle  is  a  circuit  with  distinct  nodes  (aside  from  the  initial  node  Vq  =  e„).  It  is  generally 
sufficient  to  denote  a  walk  by  its  node  sequence,  the  edges  being  implicit  in  the  nodes. 

Two  vertices  u,v  G.  V{G)  are  said  to  be  connected  in  G  if  there  is  a  path  from  v  to  v 
in  G.  and  G  is  said  to  be  a  connected  graph  if  every  pair  of  nodes  is  so  connected.  By 
convention  each  v  G  V'(0  is  connected  to  itself,  so  the  property  of  being  connected  in  G 
is  an  equivalence  relation  on  1(G).  This  relation  therefore  induces  a  partition  of  V'(G) 

into  a  number  of  disjoint  subsets  I’l . 1\.  whereby  two  nodes  are  connected  in  G  if  and 

only  if  they  belong  to  the  same  subset  of  this  partition.  The  number  k  of  sets  in  the 
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partition  is  a  graph  invariant  which  will  be  denoted  by  c{G),  so  G  is  connected  if  and 
only  if  c{G)  =  1. 

The  subgraphs  Gj,..., G*.  of  G  generated  by  the  Kj,..., Fjt  are  referred  to  as  the 
components  of  G.  and  these  are  the  maximal  connected  subgraphs  of  G.  These  form  a 
disjoint  partition  of  G  (all  the  nodes  and  edges  of  G  are  accounted  for),  and  in  many 
cases  one  can  restrict  one’s  attention  to  connected  graphs  only  —  otherwise  consider  its 
components  in  turn.  The  graph  G  is  said  to  be  disconnected  if  c(G)  >  1  and  totally 
disconnected  if  c(G)  =  p,  whereby  every  v  G  V{G)  is  isolated  in  G. 

If  u  and  V  belong  to  the  same  component  of  G,  then  we  define  the  distance 
dist(u,i.’)  between  them  as  the  length  of  the  shortest  path  joining  them.  (We  also  set 
dist(u,u)  =0,  and  dist(n, e)  =  +  oo  if  and  v  belong  to  different  components  of  G.) 
This  induces  a  metric  on  each  component  of  G,  that  is,  it  is  symmetric  and  satisfies  the 
triangle  inequality 

dist(u,  u)  =  dist(i;,  u)  (3.2a) 

dist(u,u;)  <  dist(u,u)  +  dist(v,u;).  (3.2b) 

A  graph  which  is  used  to  model  a  communications  network  is  generally  assumed  to 
be  initially  connected.  (Otherwise  we  study  each  component  as  a  network  by  itself.)  In 
this  setting  the  metric  defined  above  is  usually  referred  to  as  the  hop  distance.  If 
(t>o,  Ui,. . .,  n„)  is  a  path  in  G,  we  can  imagine  sending  a  message  from  node  Vq  to  node 
through  the  n  —  1  relay  nodes  A  total  of  n  trcinsmissions  would  be  required 

by  this  path.  Thus  the  hop  distance  dist(u, n)  is  the  minimum  number  of  transmissions 
required  to  realize  communication  between  nodes  u  and  v.  Nodes  in  different 

components  of  the  network  cannot  be  so  joined,  and  such  communication  is  not 
possible. 

3.2  MEASURES  OF  CONNECTIVITY 

Obviously  connectivity  of  a  graph  G  is  desirable  for  communications,  so  one  would 
want  to  quantify  this  property  and  determine  how  “well-connected”  it  is.  The  most 
common  way  to  do  this  is  by  specifying  the  number  of  nodes  or  edges  that  must  be 
removed  in  order  to  disconnect  G.  For  even  more  information  we  would  want  to 
identify  those  particular  nodes  or  edges  that  should  be  removed  in  order  to  do  this,  that 
is.  those  to  which  the  network  is  most  vulnerable. 

Thus,  we  say  that  v  6  V^(G)  is  a  separating  node  (or  cut-node)  of  G  if 
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c{G  —  V)  >  c((?).  Similarly,  e  €  E{G)  is  a  separating  edge  of  c{G  —  e)  >  c{G).  A  non- 
separable  or  biconnected  graph  is  one  that  is  connected  and  has  no  separating  node,  and 
a  block  of  G  is  a  maximal  non-separable  subgraph.  Note  that  the  blocks  of  a  graph  G 
are  not  necessarily  disjoint  so  as  to  form  a  partition  of  V{G),  but  two  blocks  can  have  at 
most  one  separating  node  of  G  in  common.  This  type  of  connectivity  can  be 
characterized  as  follows — not  only  can  any  pair  of  vertices  be  joined,  but  this  can  be 
done  through  any  specified  edge. 

More  generally,  we  will  say  that  a  connected  graph  G  is  n-connected,  n  >  1,  if  it 
remains  connected  after  the  removal  of  any  set  of  n  —  1  vertices.  The  node-connectivity 
k{G)  is  the  smallest  number  of  vertices  whose  removal  results  in  a  disconnected  graph. 
In  a  similar  way,  we  define  an  n-edge-connected  graph  and  the  edge-connectivity  A(G). 
It  is  always  true  that 

k{G)  <\{G)  <S{G).  (3.3) 

and  this  result  is  best-possible.  (That  is.  given  any  three  integers  a,6,c  for  which 
I  <  a  <  b  <  c,  there  exists  a  connected  graph  G  for  which  k{G)  =  a,  A(G)  =  b,  and 
8{G)  =  c.)  The  book  of  Capobianco  and  Molluzzo  [4]  is  a  good  reference  for  these 
inequalities  and  illustrates  by  example  the  fact  that  they  cannot  be  improved. 

For  communication  networks  it  is  desirable  to  have  large  values  of  k(G)  and  A(G), 
as  this  provides  for  many  alternative  paths  for  communication  between  nodes. 
Conversely,  small  values  of  these  graph  invariants  imply  that  the  network  is  relatively 
vulnerable  and  can  be  disabled  (i.e..  disconnected)  by  the  removal  of  only  a  few  of  its 
nodes  or  links. 

One  should  note,  however,  that  in  general  a  graph  is  more  susceptible  to  node 
removals  than  to  a  loss  of  edges.  For  example,  the  loss  of  the  central  node  in  a  star 
network  will  result  in  the  remaining  node  set  being  completely  disconnected,  but  for  an 
edge  removal  this  type  of  behavior  is  not  possible.  That  is.  if  k  =  A(G)  >  2.  then  no 
possible  loss  of  k  edges  can  result  in  a  graph  with  more  than  two  components. 
Moreover,  the  removal  of  an  edge  (as  opposed  to  that  of  a  vertex)  cannot  decrease  the 
number  of  components  of  a  disconnected  graph,  and  in  general 

k{G  -  v)  >  k{G)  -1  for  all  t’ G  VhG)  (3.4a) 

A(G)>A(G-c)>A(G)-l  for  all  e  G  £(G).  (3.4b) 
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3.3  CLASSICAL  RESULTS  ON  CONNECTIVITY 

In  general,  for  a  given  number  p  of  nodes,  a  {p,q)-gTa.ph.  will  have  better 
connectivity  for  larger  q.  Many  of  the  standard  results  in  graph  theory  illustrate  this 
point,  such  as 

(i)  U  q  <  p  —  1,  then  G  is  disconnected  (3.5a) 

(ii)  If  7  >  (p  —  1)  (p  —  2)/2,  then  G  is  connected.  (3.5b) 

These  statements,  so  disparate  as  to  be  essentially  useless,  use  only  the  global 
invariants  p,q  of  the  graph.  More  precise  results  can  be  stated  in  terms  of  the  local 
structure  of  the  graph  at  its  nodes.  Specifically  [4], 


(iii) 

If  S(G)  >  p/2,  then  A(G)  =  S(G). 

(3.5c) 

(iv) 

If  <5(G)  >  p  -  2.  then  k(G)  =  S(G) 

(3.5d) 

(v) 

If  S(G}  >  (p  +  n  —  2)/2,  where  1  <  n  <  p, 
then  G  is  n-connected  (i.e.,  k(G)  >  n.) 

(3.5e) 

The  most  important  results  along  these  lines  are  the  theorems  of  Menger  and  their 
many  variations  [3,  7]. 

Menger 's  Theorems  (Node/Edge  versions) 

(i)  t((G)  >  n  if  and  only  if  for  each  pair  u.vE  V(G)  of  distinct,  non- 

adjacent  nodes  there  exist  at  least  n  node-disjoint  paths  which 
connect  u  and  v. 

(ii)  A(G)  >  n  if  and  only  if  for  each  pair  u.  u  E  V(G)  of  distinct  nodes 

there  exist  at  least  n  edge-disjoint  paths  which  connect  u  and 

V. 

Observe  the  difference  in  the  types  of  u-v  connections  referred  to  here.  Two  edge- 
disjoint  paths  between  u  and  v  need  not  be  node-disjoint,  for  different  edges  may  be 
incident  to  a  common  node.  Node-disjoint  paths,  however,  are  necessarily  also  edge- 
disjoint.  Thus  there  should  be  fewer  node-disjoint  paths  than  edge-disjoint  paths 
between  any  pair  of  vertices,  indicating  once  more  the  greater  sensitivity  of  graph 
connectivity  to  node  removal  than  to  edge  removal. 

3.4  CONNECTIVITY  IN  DIRECTED  GRAPHS 

Most  of  our  definitions  for  (undirected)  graphs  transfer  directly  to  digraphs,  except 
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what  was  previously  called  a  walk  (resp.  path)  is  now  referred  to  as  a  directed  walk 
(resp.  path)  from  one  node  to  another.  The  major  difference  is  in  the  concept  of 
connectivity,  which  has  several  interpretations  when  the  edge  directions  are 
accounted  for.  We  shall  say  that  v  €  V''((j)  is  reachable  from  u  6  V{G)  if  there  is  a 
directed  path  in  G  from  u  to  v.  This  is  not  an  equivalence  relation,  however,  since 
there  may  not  be  a  directed  path  from  v  back  to  u.  We  can  talk  about  the 
‘‘components'’  of  G  with  respect  to  this  relation  (i.e.,  maximal  subgraphs),  but  they 
do  not  form  a  partition  of  the  graph  into  disjoint  pieces  [8],  as  illustrated  in  Figure 
3.1.  Here  node  3  is  reachable  from  both  node  1  and  node  4.  but  neither  one  of  these 
two  is  reachable  from  the  other. 
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The  problem  with  this  type  of  connectivity  is  that  is  does  not  result  in  a 
partition  of  the  graph  into  disjoint  subgraphs  which  can  be  analyzed  separately.  To 
avoid  this  difficulty  we  define  two  other  types  of  connectedness.  We  shall  say  that 
u.v  E  V{G)  are  weakly  connected  if  they  axe  connected  in  the  underlying  graph  G^  of 
G.  This  is  simply  the  type  of  connectivity  considered  previously.  We  shall  say  that 
u,v  £  I''^(G)  are  strongly  connected  if  each  is  reachable  from  the  other.  .A^s  this  gives  a 
true  equivalence  relation,  we  can  talk  about  strongly  (resp.  weakly)  connected 
digraphs  and  refer  to  the  strong  (resp.  weak)  components  of  digraphs.. 

There  is  still  a  problem  here,  illustrated  by  the  first  digraph  shown  in  Figure  3.2 — 
this  digraph  is  weakly  connected,  but  in  the  strong  sense  it  is  totally  disconnected. 
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That  is,  each  strong  component  is  an  isolated  vertex,  for  in  this  digraph  no  vertex  is 
strongly  connected  to  any  other. 


Nodes  u  and  v  Each  node  reachable 

not  reachable  from  w  from  the  other  two 

FIGURE  3.2  STRONGLY  DISCONNECTED  AND  CONNECTED 


Another  type  of  problem  is  also  shown  in  Figure  3.2,  where  the  reversal  of  the 
direction  of  a  single  edge  completely  disconnects  a  strongly  connected  graph.  We 
shall  avoid  these  difficulties  in  what  follows  by  usually  assuming  a  symmetric  digraph 
when  modeling  a  communications  network.  In  this  case  the  reversal  of  any  edge  is 
still  an  edge,  so  the  reversal  of  any  path  in  G  is  again  a  path  in  G.  For  such  graphs 
reachability  is  a  symmetric  relation  and  the  notions  of  strong  and  weak  connectivity 
coincide  so  that  all  of  the  previous  discussion  applies  with  only  minor  modifications. 

This  is  admittedly  a  very  strong  assumption  and  would  not  ordinarily  be  allowed, 
for  in  many  of  the  applications  of  digraphs  the  edge  directions  are  of  criticeil 
importance  to  the  problems  being  modelled.  In  scheduling  problems,  for  example,  it 
is  clearly  impossible  to  execute  one  task  before  doing  all  those  others  which  must  feed 
into  it.  We  can  justify  our  cissumption  here  only  because  we  will  eventually  limit 
ourselves  to  stochastic  networks  in  which  each  edge  is  assigned  a  probability  of  its 
being  operational.  From  the  probabilistic  point  of  view  then,  the  presence  of  any 
particular  edge  in  a  digraph  is  not  important  since  it  can  be  effectively  eliminated 
from  consideration  by  assigning  it  an  operational  probability  of  zero. 

3.5  TREES 

The  most  important  type  of  connected  graph  is  a  tree,  ncimely  a  (p,5)-graph  T 
which  is  connected  and  for  which  q  =  p—1.  Such  a  graph  is  necessaxily  circuit-free 
and  will  be  disconnected  by  the  removal  of  any  edge.  Moreover,  between  any  two 
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nodes  of  T  there  is  a  unique  path  joining  them.  .A.  subgraph  T  of  a  connected  graph 
G  is  a  spanning  tree  if  it  is  a  spanning  stibgraph  of  G  which  is  also  a  tree.  The 
addition  to  T  of  any  node  or  edge  of  G  will  then  create  a  cycle  in  T.  .A  node 
V  G  y{T)  is  pendant  if  deg(r)  =  1.  while  an  edge  e  £  E{T)  is  a  leaf'd  it  is  incident  to 
a  pendant  node. 

If  G  is  a  weighted  graph  we  say  that  a  spanning  tree  T  of  G  is  minimal  if 

w(T)  =  Me)  (3.6) 

e6  £(r) 

is  a  minimum  over  all  such  spanning  trees.  Such  spanning  trees  always  e.xist.  and  can 
be  constructed  by  means  of  the  following  [IS]. 

Theorem:  Let  Cq  G  r(G)  and  let  Cq  ^  E(G)  be  such  that  u.>(eo)  <  w{e)  for  all  edges  e 
incident  to  Cq.  Then  there  is  a  minimal  spanning  tree  of  G  which  contains  the  edge 
Co- 

This  result  is  the  basis  for  algorithms  to  construct  a  minimal  spanning  tree  for  any 
connected,  weighted  graph  G  starting  from  a  given  initial  root  node  Vq.  One  well- 
known  method  is  Prim's  method  [20] .  which  may  be  described  as  follows: 

1.  Sei\\={v^].T  =  0. 

2.  If  V^o  =  V'(G)  then  stop 

3.  Let  £o  =  (e  e  E{G):  e  =  ur,  u  G  Vq,  v  €  r(G)  -  V^} 

L  =  min  {te(e):  e  G  Eq} 
and  choose  e  ^  Eq  with  ic(eo)  =  L.  say  e  =  uv 

4.  Letro  =  CoU{r} 

T  =ru(e} 

Go  to  2 

.At  any  stage  we  adjoin  to  T  the  least  expensive  (i.e.,  smallest  weight)  edge  which 
maintains  the  tree  connectivity  without  forming  a  circuit.  If  we  do  not  insist  on 
keeping  T  connected  we  obtain  Kruksal's  method,  which  builds  up  a  collection  of 
disjoint  trees  which  gradually  merge  into  one  another  to  exhaust  F(G).  .Also,  for  a 
disconnected  graph  this  algorithm  can  be  used  to  find  the  components  of  G,  if  in  step 
3  we  choose  e  G  £0  arbitrarily,  but  stop  as  soon  as  Eq  =  O.  The  set  I'q  is  then  the  set 
of  nodes  in  the  component  of  G  which  contains  Vq.  (The  same  procedure  finds  a 
spanning  tree  in  an  unweighted  graph — simply  define  a  weight  function  as  ir(e)  =  1. 

eG  £(G).) 
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Prim's  method  is  an  example  of  a  greedy  algorithm  [21],  which  always  chooses  the 
l)est  edge  available  at  that  time.  That  is.  it  makes  a  locally  optimal  selection,  which 
for  this  example  does  in  fact  result  in  a  global  optimum,  a  minimal  spanning  tree. 
Greedy  methods  do  not  always  yield  optimal  solutions,  but  for  many  problems  in 
graph  theory  they  do. 

If  r  is  a  digraph  we  generally  refer  to  a  rooted  tree,  which  has  a  distinguished  node 
t'o  €  I  (T).  the  root,  such  that 

(i)  indeg(ro)=0  (3.7a) 

(ii)  indeg(t')  =  1  for  all  r  €  V{T)  —  Vq,  (3.7b) 

(iii)  Every  r  6  \  'iT)  is  reachable  from  Vq.  (3.7c) 

When  we  speak  of  a  subtree  or  spanning  tree  of  a  digraph  G  we  implicitly  mean  a 

stibgraph  of  this  type.  Most  of  the  above  definitions  make  sense  in  this  context  and 
the  graph  algorithms  used  will  be  applied  here.  Note  however,  that  Prim's  algorithm 
gives  a  minimal  spanning  tree  rooted  at  Vq.  This  is  not  necessarily  a  minimal 
spanning  tree  of  G.  for  it  is  not  known  a  ■priori  how  the  root  should  be  chosen  so  as 
to  give  the  desired  global  minimum. 
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4.  GRAPH  ALGORITHMS 

Many  problems  are  naturally  formulated  in  terms  of  graphs  or  digraphs,  such  as 
transportation  or  communication  networks,  job  scheduling  in  manufacturing,  physical 
connectivity  in  electrical  circuits,  etc.  Obvious  questions  arise,  such  as  which  nodes  are 
connected  to  a  given  node,  what  path  between  two  nodes  is  the  least  expensive,  or  when 
each  manufacturing  task  should  be  scheduled. 

For  probabilistic  graphs  we  don't  ask  if  the  graph  is  connected,  but  what  the 
probability  of  that  event  may  be.  or  we  might  w'ant  the  average  distance  (expected 
value)  between  two  vertices.  \Ve  consider  here  some  deterministic  problems  which 
process  only  the  graph  data  ViG)  and  E{G).  along  with  any  edge  costs  if  vve  have  a 
weighted  graph.  Problems  involving  probabilistic  networks  will  be  considered  below  in 
Section  5,  but  there  is  a  significant  difference  here.  As  we  shall  see  below,  it  is  very 
easy  to  determine  if  a  graph  is  connected,  but  very  difficult  to  compute  the  probability 
that  this  is  the  case. 

Before  answering  these  probabilistic  cpiestions  we  must  first  deal  directly  with  the 
nodes  and  links  of  the  graph  that  represents  the  network.  For  example,  the  probability 
of  two  nodes  being  connected  depends  on  the  existence  of  paths  between  them,  so  w’e 
would  need  to  have  an  algorithm  which  returned  a  list  of  all  such  paths,  if  any.  The 
area  of  graph  algorithms  is  a  very  important  part  of  graph  theory,  particularly  for 
reliability  problems  which  are  computationally  rather  expensive.  We  are  interested  in 
(’fficient  algorithms,  where  efficiency  is  judged  by  being  able  to  run  the  algorithm  to 
completion  in  an  acceptably  short  time.  Questions  regarding  available  storage  or 
internal/e.xternal  memory  requirements  are  not  considered. 

4.1  ALGORITHM  COMPLEXITY 

The  complexity  of  an  algorithm  is  generally  measured  by  the  runtime  of  its  imple- 
mentatioi;  on  a  digital  computer  versus  the  size  of  the  input  data  used  by  the 
algorithm.  It  is  important  to  recognize  that  this  is  a  measure  of  the  algorithm's  worst- 
case  p<Tformance.  That  is.  we  seek  an  upper  hound  f{u)  on  the  maximum  number  of 
elementarv  o[)erations  to  carry  out  tlie  algorithm  as  a  function  of  the  size  n  of  the  input 
<lata.  It  may  happen  that  some  inputs  are  processed  very  quickly,  or  that  the  average 
performance  is  fast,  but  we  want  a  guaranteed  upper  bound  for  the  computation  over 
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all  allowable  inputs.  References  [29-38]  are  all  good  references  to  the  general  theory  of 
algorithms  and  algorithmic  complexity.  Of  these  the  book  of  Harel  [33]  is  especially 
recommended  as  a  very  readable  and  wide-ranging  view  of  the  field,  and  Sedgewick  [37] 
has  a  good  discussion  of  algorithm  implementation. 

In  our  case  the  input  data  is  a  graph  or  digraph  G[V,  E)  consisting  of  p  vertices 
and  q  edges,  so  the  complexity  of  an  algorithm  will  generally  be  estimated  by  a  function 
of  p  and/or  q.  We  shall  use  the  0-notation  here — given  two  functions  f{n)  and  gin),  to 
say  that  f{n)—0[g(n)]  if  there  is  a  constant  M  >  0  so  that  f{n)<Mg{n)  for  all 
sufficiently  large  n.  The  meaning  of  this  is  that  /(n)  is  no  worse  that  g{n),  up  to  a 
multiplicative  constant.  .An  algorithm  is  said  to  run  in  polynomial  time  if  its 
comple.xity  satisfies  f{n)=0(n^)  for  some  k>0.  Special  cases  are  constant  time 
{k  =  0)  and  linear  time  (k  =  1).  .As  an  example,  the  well-known  quicksort  algorithm  to 
.sort  a  collection  of  n  objects  is  of  type  O(n^),  although  its  average  performance  is  known 
to  be  0(n  log  n).  much  faster  for  large  n. 

Polynomial  time  is  generally  taken  as  the  boundary  between  practical  and 
intractable  problems.  Those  problems  for  which  there  are  no  known  polynomial-time 
algorithms  must  generally  be  solved  using  algorithms  which  exhibit  exponential  growth 
in  the  input  size  and  are  used  only  for  relatively  small  networks.  For  faster 
computation  one  must  settle  for  only  an  approximate  solution,  or  restrict  the  input  data 
to  special  networks  which  do  admit  a  polymonial-time  solution. 

4.2  NP-HARD  AND  NP-COMPLETE  PROBLEMS 

Of  special  interest  here  are  the  so-called  NP  (non-deterministic  polynomial) 
problems.  Roughly  speaking,  these  are  problems  for  which  there  exists  a  polynomial¬ 
time  algorithm  which  can  verify  the  validity  of  any  proposed  candidate  solution  to  the 
problem,  although  the  problem  itself  may  not  be  known  to  have  a  polynomial  time 
solution.  .A  problem  is  called  .VP-hard  if  it  is  at  lecist  cis  difficult  as  any  .VP-problem 
(that  is.  a  polynomial-time  solution  for  that  problem  would  imply  such  a  solution  to 
rvery  NP  problem).  .An  .VP-hard  problem  is  said  to  be  .VP-complete  if  it  is  itself  an 
.VP  problem. 

It  is  not  known  at  this  time  if  there  exists  any  .VP-problem  which  does  not  have  a 
polynomial- time  solution.  This  would  require  a  lower  bound  on  such  a  problem's 
complexity  which  grew  faster  than  any  polynomial  function.  The  book  of  Garey  and 
•Johnson  [32]  is  the  most  complete  introduction  to  .VP-completeness.  although  its 
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compilatiou  of  .VP-complete  problems  is  already  quite  dated. 

Unfortunatel}-.  mane  u-ell-kno\vn  j)roblem.s  in  graph  theory  ha\’e  been  shown  to  be 
P-hard.  including  some  that  are  related  to  network  reliability  [40,  41].  .A.s  an 
example,  we  consider  the  following  two  questions  for  a  connected  graph  G. 

(i)  an  Eideruin  tour  of  G  is  a  circuit  of  G  in  which  each  edge  occurs  exactly  once. 

(ii)  a  Hantilton/au  tour  of  G  is  a  spanning  cycle  of  G.  (This  is  closely  related  to  the 
well-known  traveling  salesman  problem.  ] 

The  question  is  whether  such  tours  exist,  but  the  answers  are  cpiite  different.  The 
Eulerian  tour  problem  is  solvable  by  an  Oip~)  algorithm,  while  the  Hamiltonian  tour 
problem  is  .VP-complete,  and  as  such  the  ([uestion  is  essentially  unanswerable  for  large 
networks.  The  inefficiency  of  a  deterministic  algorithm  for  an  .VP-complete  problem  is 
due  to  the  exponentially  large  number  of  possible  solutions  that  must  be  dealt  with,  and 
it  may  be  possible  to  decrease  this  expense  by  better  defining  and  limiting  the  number 
and  type  of  potemtial  candidates  in  the  search.  While  still  resulting  in  e.xponential 
complexity,  the  order  of  growth  is  decreased  and  larger  graphs  can  be  processed  in  a 
prescribed  amount  of  time.  Because  of  its  practical  importance,  the  Hamiltonian  tour 
problem  has  been  more  intensively  investigated  than  any  other  .VP-complete  problem 
[3.  13].  For  large  graphs,  however,  ail  of  the  available  search  methods  are  intractable, 
and  one  must  settle  for  approximate  solutions  [21]. 

It  shoidd  be  noted,  however,  that  some  problems  in  the  theory  of  graphs  are 
intrinsically  non-polynomial  in  nature.  For  example,  the  number  of  minimal  spanning 
trees  in  a  completely  connected  grajdi  on  p  vertices  is  p^~^.  Thus  any  algorithm  to 
compute  all  the  spanning  trees  of  a  graph  must  expect  to  work  at  least  this  hard,  which 
is  much  worse  than  any  exponential  growth.  Similarly  the  number  of  minimal  cutsets 
and  minimal  paths  in  graphs  also  exhibit  such  explosive  growth  and  cannot  be 
completely  specified  by  any  polynomial-time  algorithm.  No  A^P-problem  is  known  to 
be  of  this  type. 

4.3  GR.APH  TR.4\'ERS.4LS 

.-k  travfi'.sal  of  a  connected  graph  is  simply  a  procedure  to  visit  the  nodes  of  G  in 
'^ome  systematic  way,  starting  from  .some  given  node  and  moving  to  adjacent  nodes  via 
the  I'dges  of  the  graph.  The  two  most  common  methods  for  doing  this  are  the  depth- 
first  search  and  breadth- first  search  [20.  21].  Both  can  be  used  to  construct  spanning 
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trees  of  G  and  to  determine  the  component  of  the  graph  that  contains  the  initial  root 
node. 

In  a  depth-first  search,  immediately  on  visiting  one  node  we  move  on  to  visit  one  of 
its  imvisited  neighbors,  if  any.  If  this  is  not  possible  we  backtrack  to  the  last  node  that 
was  visited  which  does  have  unvisited  neighbors  and  repeat  the  process.  The  procedure 
terminates  only  when  the  visited  nodes  have  no  unvisited  neighbors,  exhausting  V{G)  if 
that  graph  was  connected.  This  very  simple  procedure  is  inherently  recursive  and  is 
best  implemented  in  this  way: 

Depth-First  Search 

1.  Given  root  node  Uq,  set  T  =  0 

2.  Set  c(u)  =  0  for  all  n  6  I/(G) 

3.  Visit  (t’o) 

4.  Return  T 

The  initially  empty  edge  set  T  will  be  a  tree  which  is  rooted  at  Vq  and  spans  the 
component  of  G  containing  I’o-  Hence,  for  a  connected  graph,  T  is  a  spanning  tree  of  G. 
It  is  constructed  by  successive  calls  to  the  recursive  function  Visit,  that  is 

Visit{v) 

c{v)  =  1 

For  all  u  G  N gbd{v)  do 
If  c{u)  =  0  then 
Visit  (u) 

T  =  T  U  {nn} 


In  this  function,  each  call  to  the  function  immediately  marks  the  current  vertex  v  as 
having  been  visited  (c(v)  =  1),  but  no  edges  are  added  to  the  tree  T  as  long  as  that 
vertex  .still  has  neighboring  vertices  which  have  not  yet  been  so  maxked.  Instead,  it 
moves  to  one  of  the  unmarked  neighbors  and  repeats  the  process.  The  function  call 
finally  returns  with  an  edge  added  to  T  only  when  it  ha.s  reached  and  marked  as  visited 
a  vertex  all  of  whose  neighbors  have  already  been  visited.  Thus  the  search  is  forced  to 
look  for  new  nodes  to  visit  which  are  farther  away  from  the  root,  and  it  will  move 
backward  towards  the  root  only  when  it  encounters  a  dead  end  (although  it  can  be 
expected  to  form  very  long  paths  which  will  loop  back  closer  to  the  root).  It  tends  to 
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produce  very  long  trees  with  few  branches,  as  the  last  vertex  processed  yields  the  first 
edge  of  T. 

Breadth-first  search,  on  the  other  hand,  visits  all  neighboring  vertices  of  a  given 
vertex  before  moving  farther  away  from  the  root,  and  as  such  it  tends  to  produce  short, 
fat  trees  with  many  branches.  It  can  also  be  given  in  recursive  form,  but  is  perhaps 
more  naturally  non-recursive.  One  version  of  breadth-first  search  is  described  cis 
follows: 

Breadth-First  Search 


1.  Set  c(u)  =  0  for  all  v  €  V{G) 
i  =  l 
c(i>o)  =  1 


2. 


Let  5  =  (e:  c(i’)  =  0.  u  G  Ngbd{v)  for  some  u  with  c(ii)  =  i} 
If  5  =  0  then  stop 


3. 


i  —  ?  "h  1 

c(u)  =  i  for  all  i>  G  S 


4.  Go  to  2 


This  implementation  of  the  breadth-first-search  algorithm  includes  a  vertex-labelling 
scheme  which  indicates  how  far  the  search  has  proceeded  when  it  is  visiting  any 
particular  vertex.  (That  is.  it  computes  c(i;)  =  1 -f  dist(i;,  Uq)  for  all  vertices  vG:V{G) 
which  are  connected  to  the  root  vertex  Vq.)  For  any  given  distance  d  from  the  root,  the 
search  will  visit  all  nodes  at  distance  d  before  moving  out  to  look  for  nodes  connected  to 
^0  at  a  distance  of  d  -f  1.  Edges  closer  to  the  root  are  considered  earlier  in  the  process. 

These  procedures  have  many  extensions  and  variations.  Depth-first  search  can  be 
used  to  check  for  biconnectivity  [20],  while  the  idea  behind  breadth-first  search  can  be 
used  to  find  minimal  paths  in  weighted  graphs  [31].  The  precise  form  of  the  tree 
constructed  depends  on  the  order  in  which  the  edges  are  represented.  .A.lso,  if  the  graph 
were  not  connected  then  both  of  these  algorithms  visit  those  nodes  of  G  which  lie  in  the 
connected  component  containing  the  root  node  Uq.  Thus  these  algorithms  can  check  for 
the  connectivity  of  G,  or  can  find  all  of  its  components  should  it  be  disconnected. 

4.4  EXAMPLES 

As  an  example,  in  Figures  4-1  and  4-2  below,  we  show  ho\v  the  depth-first  and 
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breadth-first  methods  might  be  implemented  in  Pascal  when  using  an  adjacency  matrix 
representation  of  the  graph.  The  algorithm  outputs  given  in  Figure  4-3  and 
diagrammed  in  Figure  4-4  show  their  operation  on  the  (8,  20)-graph  whose  edge  list  is 
shown.  The  edges,  starting  from  node  are  shown  in  the  order  processed  by  the 
computer.  Thus  in  the  depth-first-search  algorithm  the  edge  from  node  3  to  node  4  is 
the  first  edge  added,  since  node  4  is  the  first  vertex  from  which  the  search  has  to 
backtrack  from  a  dead  end.  Conversely,  the  edge  from  node  1  to  node  2  is  the  last  edge 
added  to  the  tree,  although  it  was  the  first  link  traversed  by  the  algorithm.  The 
breadth-first-search,  however,  also  traversed  this  link  first  but  adjoined  it  to  the  tree 
immediately.  Note  that  these  two  algorithms  are  both  of  complexity  0{p^),  due  to  the 
use  of  loops  for  the  adjacency  matrix  representation.  For  sparse  matrices  the  use  of 
adjacency  lists  can  do  much  better,  say  0{p  +  q). 


24 


Graph  Algorithms 


CONST 

MAXV  =  20;  (*  Maximum  number  of  graph  nodes  »=) 


TYPE 

NodeSet  =  Set  OF  1..MAXV: 

NodeList  =  Array[l..MAXV]  OF  Integer; 
AdjMat  =  Array[l..MAXV]  OF  NodeList; 


VAR 

A  :  AdjMat:  (*  A  =  the  adjacency  matrix  of  the  graph  «) 

NodeNum  :  Integer;  (’f  NodeNum  =  number  of  nodes  in  the  graph  *) 

(* . . - . . . . . *) 


PROCEDURE  DFS(Root  :  Integer); 

(*  Depth-First  Search  of  graph  from  root  node  >“) 
VAR 

i  :  Integer: 

Tag  :  NodeList: 

("' . *) 

PROCEDURE  Visit(k  :  Integer); 

(*  Recursive  traversal  from  node  k  *) 

VAR 

i  :  Integer; 

BEGIN 
Tag[k]  :=  1; 

FOR  i  :=  1  TO  NodeNum  DO 
IF  (A[i.k]  <>  0)  AND  (Tag[i|  =  0)  THEN 
BEGIN 
Visit(i); 

Writeln(’Edge:  \  k.  '  ->  \  i); 

END; 

END;  (*  Visit  *) 

(* . ‘) 

BEGIN  (*  Start  of  DFS  «) 

FOR  i  :=  1  TO  NodeNum  DO 
Tag[i]  :=  0; 

Visit(Root); 

END;  (*  DFS  *) 


NOTES:  This  is  a  recursive  version  of  the  depth-first  search.  The  Tag  vector 
is  simply  a  Boolean  labelling  of  the  nodes,  where  Tag[v]  =  0  if  and  only  if 
node  V  has  not  yet  been  visited.  The  edges  are  written  out  in  the  reverse 
order  of  that  in  which  they  are  traversed.  Bookkeeping  is  kept  to  a  minimum 
in  this  version  of  the  algorithm. 


FIGURE  4-1  DEPTH-FIRST  SEARCH  ALGORITHM 
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CONST 

MAXV  =  20;  (»  Maximum  number  of  graph  nodes  *) 

TYPE 

NodeSet  =  Set  OF  1..MAXV: 

NodeList  =  Array[l..MAXV)  OF  Integer; 

AdjMat  =  Array[l..MAXV]  OF  NodeList; 


VAR 

A  ;  AdjMat:  (*  A  =  the  adjacency  matrix  of  the  graph  *) 

NodeNum  :  Integer;  (»  NodeNum  =  number  of  nodes  in  the  graph  *) 

(* . -  . . . *) 


PROCEDURE  BFS(Root  :  Integer); 

(*  Breadth- First  Search  of  graph  from  root  node  *) 

VAR 

Tag  ;  NodeList: 

Next  ;  NodeSet: 
i,  j,  Count  :  Integer: 

BEGIN 
Count  :=  1; 

FOR  i  :=  1  TO  NodeNum  DO 
Tag[i]  ;=  0; 

TagjRoot]  :=  Count; 

WHILE  TRUE  DO 
BEGIN 
Next  [Jj 

FOR  j  :=  1  TO  NodeNum  DO 
FOR  i  :=  1  TO  NodeNum  DO 
BEGIN 

IF  (Tag[i]  =  0)  AND  (TagO]  =  Count)  AND  (A[i,j]  <>  0)  THEN 
BEGIN 

Next  :=  Next  -t-  [i|; 

Tag[i]  ;=  Count  +  1; 

Writeln(’Edge:  \  j,  '  ->  i); 

END: 

END: 

IF  (Next  =  [])  THEN  Exit; 

Inc(Count): 

END;  (*  WHILE  «) 

END;  (*  BFS  *) 

NOTES:  This  is  a  non-recursive  version  of  the  breadth-first  search.  The  Tag 
vector  here  will  label  the  vertices  according  to  their  distance  from 
the  initial  node,  where  again  Tag(v]  =  0  implies  that  node  v  has  not 
yet  been  visited.  If  the  NodeSets  Next  were  accumulated,  then  the  final 
result  would  contain  those  nodes  in  the  connected  component  of  G  which 
contains  the  root  node. 


FIGURE  4-2  BREADTH-FIRST  SEARCH  ALGORITHM 
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Example  Graph: 

Root  Node:  1 

No.  of  Nodes:  8 
No.  of  Links:  20 

Edge  Listings: 

1  2 
1  4 

1  5 

2  1 

2  3 
2  0 

3  2 
3  4 

3  7 

4  1 

4  3 

5  1 
5  6 

5  8 

6  2 

6  5 

7  3 

7  8 

8  5 
8  7 


Algorithm  Outputs: 


DFS:  Root  =  #1 


Edge:  3  -->  4 
Edge:  5  ->  6 
Edge:  8  ->  5 
Edge:  7  ->  8 
Edge:  3  —  >  7 
Edge:  2  ->  3 
Edge:  1  ->  2 


BPS:  Root  =  #1 


Edge:  1  ->  2 
Edge:  1  ->  4 
Edge:  1  ->  5 
Edge:  2  ->  3 
Edge:  2  -->  6 
Edge:  5  -->  8 
Edge:  3  ->  7 


FIGURE  4-3  EXAMPLE  OUTPUTS  OF  SEARCH  ALGORITHMS 


Graph  Algorithms 


Original 
Graph  G 


Depth  -  First 
Search  Tree 


Breadth  -  First  y  ^ 
Search  Tree  I  • 


NOTE;  Max  {dist( u.VgYu  G  V^G)}  =  3  in  the  original  graph  and  in  the  breadth- 
first  search,  but  this  value  has  increased  to  6  with  depth-first  searching 

FIGURE  4.4  GRAPH  TRAVERSALS  AND  SPANNING  TREES 
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4.5  OTHER  GRAPH  ALGORITHMS 

Other  important  polynomial-time  graph  algorithms  are  those  of  Prim  and  Kruksal 
for  constructing  minimal  spanning  trees  of  a  connected  graph  starting  from  an  initial 
root  node.  Again,  both  of  these  algorithms  are  0{p~)  when  the  graph  is  represented  by 
an  adjacency  matrix.  Also,  shortest-path  algorithms  are  basically  breadth-first  searches 
(as  is  Prim's  algorithm)  and  give  constructions  such  as  Dijkstra's  method  and  shortest- 
path  spanning  trees. 

Transitivity  problems  also  arise  here.  The  transitive  closure  G*  =  G{V,  E‘)  of  G  is 
the  graph  for  which  e  =  uv  G  E"  if  and  only  if  there  is  a  path  from  u  to  v  in  G.  Here 
VVarshall's  method  (compute  the  adjacency  matrix  of  G",  gi\'en  that  of  G)  and  Floyd’s 
algorithm  (calculate  the  minimal  cost  of  each  e  £  E”)  are  used,  both  of  which  are  of 
order  O(p^). 

Other  :VP-complete  graph  problems  include  longest  path  problems,  minimum 
vertex  colorings,  maximal  clique  size  (largest  completely  connected  subgraph),  and 
minimum  vertex  coverings  (smallest  set  of  nodes  to  which  every  edge  is  incident).  No 
polynomial  time  algorithms  are  known  for  any  of  these  questions,  and  the  most 
commonly  accepted  opinion  is  that  no  such  algorithms  exist.  That  is,  the  .VF-complete 
problems  probably  have  no  polynomial  time  solutions. 

For  .VF-complete  problems  a  polynomial-time  algorithm  can  now  be  found  only 
when  some  restrictions  are  made  on  the  inputs  allowed.  For  example,  if  a  graph  G 
satisfies 

deg(ti)  +  deg(c)  >  P  (4-1) 

for  all  non-adjacent  nodes  u.  i’  G  V[G),  then  it  is  known  that  G  has  a  Hamiltonian  cycle. 
In  this  case  there  is  an  0{p^)  algorithm  which  will  find  such  a  cycle,  but  this  will  fail  for 
non- Hamiltonian  graphs  or  for  some  Hamiltonian  graphs  which  do  not  satisfy  the  above 
condition  (which  is  not  a  necessary  condition  for  Hamiltonicity).  See  Buckley  and 
Harary  [3]  for  a  description  of  such  an  algorithm. 
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5.  PROBABILISTIC  NETWORKS  AND  RELIABILITY 

Communication  networks  are  generally  modelled  by  weighted  digraphs  in  which  the 
nodes  represent  communication  centers  (transmitters,  receivers,  relays)  and  the  edges 
are  communication  channels  between  the  nodes.  The  weight  associated  with  each  edge 
is  taken  to  be  the  probability  of  that  edge  being  operational.  .A.  reliability  measure  on 
such  a  network  is  then  the  expected  value  (in  the  probabilistic  sense)  of  some  aspect  of 
the  connectivity  of  the  underlying  graph.  In  particular,  we  are  interested  in  the 
probability  that  a  path  exists  between  two  given  nodes,  or  the  probability  that  the 
network  is  connected  [14]. 

This  section  describes  some  of  the  reliability  measures  of  interest  and  discusses  the 
problems  involved  in  calculating  them.  Specific  methods  used  for  reliability 
computations  are  considered  in  the  following  section.  Unfortunately,  most  problems  in 
reliability  are  iVP-hard  [39-47].  While  it  is  possible  to  obtain  polynomial  time 
algorithms  for  reliability  calculations  of  some  restricted  classes  of  networks,  such 
restrictions  are  far  too  strong  to  be  applied  to  mobile  communication  networks 
operating  in  stressful  environments. 


5.1  MODELING  OF  COMMUNICATION  NETWORKS 

We  shall  take  as  our  model  of  communication  networks  a  weighted  digraph 
G  =  G(V',£')  with  the  weight  function  P  :  E{G)—^R  interpreted  as  the  probability  of  the 
edges  being  operational.  (That  is.  1  —  P{e)  is  the  probability  that  edge  e  G  E{G)  has 
failed.)  It  is  not.  in  general,  assumed  that  P{e)  =  P{e')  for  e  G  E{G),  where  e'  G  E{G)  is 
the  reversal  of  e.  If  this  is  always  the  case  then  we  will  refer  to  G  as  a  symmetric  or 
undirected  network.  Many  of  the  results  in  the  literature  apply  only  to  this  case,  which 
does  not  model  some  local  node/link  behavior,  such  as  directional  antennas  for 
transmission  and  reception. 

In  what  follows  we  shall  concern  oiirselves  primarily  with  link  failures,  which  has 
been  the  situation  dealt  with  most  extensively  in  the  literature.  That  is.  we  initially 
assume  that  the  nodes  are  completely  reliable,  so  we  do  not  consider  a  corresponding 
probability  function  defined  on  the  set  of  vertices  of  G.  The  problems  posed  by  node 
failures,  however,  are  of  importance  in  mobile  communications  networks,  so  we  will 
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return  to  this  question  I)elo\v.  Of  most  significance  here  is  how  some  of  the  methods 
which  have  been  developed  to  treat  only  edge  failures  can  easily  be  modified  to  take 
into  account  the  possibility  of  node  failures  as  well. 

An  important  assumption  usually  made  at  this  point  is  that  all  edge  failures  (and 
node  failures,  if  these  are  included  in  the  model),  are  statistically  independent.  This 
assumption  greatly  simplifies  the  reliability  calculation,  but  is  unrealistic  in  many 
situations,  such  as  when  some  communication  equipment  is  shared  by  several  links. 
However,  the  modeling  of  dependent  equipment  failures  generally  requires  the  use  of 
conditional  and/or  joint  probability  distributions,  of  which  there  are  potentially  an 
exponentially  large  number  [4S-56].  Since  most  reliability  problems  are  known  to  be 
.VP-hard,  even  in  the  enumerative  sense,  this  greatly  increases  the  difficulty  of  the 
computation,  and  we  will  assume  independent  component  failures  in  what  follows. 

5.2  RELIABILITY 

.As  described  previously,  network  reliability  is  a  measure,  now  in  the  probabilistic 
sense,  of  the  connectivity  of  a  network.  The  most  common  statistic  used  is  the  so-called 
K-terminal  reliability  where  A'  C  V{G)  is  a  distinguished  subset  of  the  vertices  of  the 
graph.  The  reliability  problem  is  to  calculate  the  probability  that  every  two  vertices  in 
I\  are  connected  by  a  path  of  operational  edges.  Of  particular  interest  here  is  the  2- 
terminal  reliability,  when  |A'|  =  2,  and  the  all-terminal  reliability,  where  A'  =  V{G). 

For  2-terminal  reliability  we  have  K  =  {s.t}  where  s,t  6  V{G)  are  referred  to  as  the 
.source  and  terminal  (or  sink)  respectively.  The  st-reliability  of  the  network  is  the 
probability  of  the  source  being  able  to  communicate  to  the  sink.  Note  that  if  we  have  a 
non-symmetric  network  it  is  important  which  node  is  the  source  and  which  is  the  sink, 
as  we  are  asking  for  the  probability  of  a  working  path  from  the  source  to  the  sink.  The 
■st-reliability  for  an  undirected  network  can  be  considered  as  a  special  case  of  that  for  a 
directed  network  (each  edge  is  equivalent  to  two  edges  with  opposite  directions  but 
equal  probability  of  failure).  The  st-reliability  will  be  denoted  by  ReI(G;s,  t). 

For  an  undirected  network  the  all-terminal  reliability  is  simply  the  probability  that 
the  network  is  connected,  or.  what  is  the  same,  the  probability  that  the  graph  G 
contains  at  least  one  operational  spanning  tree  or  subgraph.  For  directed  networks  the 
all-terminal  reliability  measures  the  probability  that  the  network  is  strongly  connected. 
It  seems,  however,  that  strong  connectivity,  even  in  the  probabilistic  sense,  is  too 
restrictive  a  requirement.  For  e.xample.  using  local  jamming  it  might  be  relatively  easy 
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to  isolate  a  particular  node  on  the  periphery  of  a  graph  so  that  it  could  no  longer  receive 
messages  from  the  rest  of  the  network.  Even  if  transmissions  from  this  node  could  be 
relayed  throughout  the  network  to  all  the  other  nodes,  the  reliability  of  the  network  in 
the  strong  sense  would  be  zero.  This  should  be  considered  too  pessimistic  a  reliability 
measure  for  the  overall  performance  of  such  a  network.  The  all-terminal  reliability  will 
be  denoted  by  Rel(G),  and  for  an  arbitrary  K  C  E(G)  we  let  Rel(G;A')  denote  the  K- 
reliability  of  G. 

Since  we  cannot  abandon  the  use  of  directed  networks  if  we  want  to  model  any 
direction-dependent  behaviour  in  the  network  links,  the  all-terminal  reliability  is  not  an 
appropriate  statistic.  Instead  we  shall  concentrate  on  the  2-terminal  st-reliability.  If  a 
global  figure  for  the  network  as  a  whole  is  required,  we  could  use  the  average  of  these 
reliabilities  over  all  possible  /;(p  —  1)  node  pairs. 

Res(G)  y  Rel(G:s.f).  (5.1) 

the  so-called  resiliency  [63]  of  G.  This  measures  the  fraction  of  all  possible  source-sink 
pairs  for  which  communication  paths  can  be  established.  (Of  course,  we  have 
Rel(G;s.f)  =  Rel(G;<.s)  if  G  is  a  symmetric  network.) 

5.3  SPECIAL  CASES 

It  is  an  unfortunate  fact  of  life  that  the  calculation  of  any  of  these  reliability 
measures  for  an  arbitrary  communications  network  is  an  A'F-hard  problem.  This 
remains  true  even  if  a  number  of  additional  simplifying  assumptions  are  imposed.  For 
example,  one  might  (unrealistically)  require  that  all  link  probabilities  are  equal,  say 
P(e)  =  .^0  for  all  e  G  E(G),  where  0  <  .ig  <  1  is  a  constant.  The  reliability  calculations 
can  then  be  reduced  to  problems  in  graph  enumeration.  Let  d/*.  be  the  number  of  paths 
from  .s  to  t  of  length  k.  0  <  k  <  q.  Then  we  have 

Rel(G;,s.t)=  J3.U,4(l-.^)‘^-'-  (5.2) 

k  >  1 

.Analogously,  if  is  the  number  of  spanning  subgraphs  of  G  having  k  edges,  then 

Rel(G)=  ^  (5-3) 

k  >  1 

However,  even  the  graph  enumeration  (piestions  of  determining  the  coefficients  .1/j.  and 
are  .VP-hard  [47]. 
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An  alternative  simplification  is  to  restrict  the  class  of  networks  under  consideration 
[57-67],  say  to  those  whose  underlying  graph  is  planar.  (That  is,  the  nodes  and  edges 
can  be  drawn  in  the  xy-plane  in  such  a  way  that  no  two  edges  intersect,  except  at  a 
node  of  the  graph,)  However,  the  reliability  problem  is  also  .VP-hard  for  planar 
networks  [5S],  This  is  even  true  for  grid  networks  where  the  nodes  are  located  at  the 
points  of  a  regular  square  lattice  in  the  xt/-plane  and  each  node  has  direct  edge 
connections  only  with  its  nearest  neighbors  [62],  While  it  is  possible  to  still  further 
restrict  the  networks  being  considered  so  as  to  obtain  a  class  (e.g  ,.  series-parallel 
graphs)  of  graph  inputs  for  which  polynomial  time  reliability  algorithms  do  exist,  such 
restrictions  are  too  severe  for  our  purposes. 

Indeed,  for  mobile  communication  networks  it  is  not  realistic  to  restrict  the 
possible  topology  or  node  interconnectivity  in  any  way  whatsoever.  The  links  in  such 
networks  are  constantly  changing  as  the  nodes  move  in  and  out  of  range  of  one  another. 
Moreover,  under  battlefield  stress  conditions,  such  as  local  jamming  or  attack,  any  link 
or  node  is  subject  to  failure.  In  principle,  then  any  (p,  g)-digraph  could  occur  as  the 
underlying  graph  of  a  communications  network.  For  the  same  reasons,  no  a  priori 
restrictions  could  be  placed  on  the  values  of  the  edge  failure  probabilities. 

5.4  RELIABILITY  ALGORITHMS 

Thus  any  algorithms  to  be  considered  for  use  in  our  network  reliability  calculations 
should  be  able  to  deal  with  a  completely  general  network  with  no  constraints  imposed 
on  either  the  available  edge  connections  or  on  the  values  of  the  link  probabilities.  .A 
number  of  such  methods  have  been  developed  in  recent  years,  and  improvements  are 
constantly  being  made.  We  will  discuss  a  few  of  these  in  some  detail  in  the  next 
section,  but  the  reader  should  take  note  of  the  various  survey  articles  on  network 
reliability  that  have  been  previously  published  [68-82], 

Not  all  of  these  bear  directly  on  communications,  some  being  either  too  specialized 
or  emphasizing  some  aspect  of  reliability  of  less  immediate  interest  to  us,  but  all  are 
worth  perusing  if  only  to  get  a  different  point  of  view.  Thus  [80]  is  concerned  almost 
exclusively  with  planar  networks,  much  too  restrictive  a  condition  to  be  required  of  a 
mobile  communications  network.  .Another  example  is  [69],  which  stresses  the 
applications  of  reliabilitv  criteria  in  the  synthesis  of  communication  networks.  From 
this  point  of  view  the  node/edge  configuration  of  the  graph  is.  at  least  to  .some  extent, 
at  the  disposal  of  the  designer  and  can  be  adapted  as  needed  to  satisfy  minimal 
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reliability  or  vulnerability  requirements. 

One  topic  that  has  not  been  mentioned  yet  but  which  will  obviously  be  of  great 
interest  in  the  future  is  the  application  of  parallel  processing  techniques  to  solving 
graph-theoretic  problems.  Here  the  computer  consists  of  many  separate  processing 
elements,  each  working  simultaneously  on  its  own  part  of  the  problem.  With  a  large 
number  oi  processors  the  time  required  to  compkce  the  computation  will  generally  be 
much  less  than  when  the  data  is  processed  serially  in  the  usual  way  through  a  single 
central  processor. 

For  example,  to  simply  add  up  ,V  numbers  obviously  requires  N  —  1  additions, 
hence  0{.\]  machine  cycles  on  such  a  computer.  However,  if  there  were  available  .V 
parallel  processors,  one  for  each  item  of  data,  then  this  can  be  done  in  0(log2A’^)  cycles, 
an  enormous  saving  for  large  .V.  Similarly,  no  all-purpose  serial  sorting  algorithm  can 
process  .V  items  in  better  than  0(.\  log  .V)  time,  but  much  better  algorithms  can  be 
devised  to  do  this  if  .V  or  more  processors  are  available.  .Another  example  is  that  of 
matrix  multiplication — if  processing  elements  are  available,  then  two  N  x  N 

matrices  can  be  multiplied  in  0(.V)  time. 

Many  of  the  standard  graph  problems  have  been  studied  from  this  point  of  view 
and  parallel  algorithms  are  available  for  various  computer  architectures  [83-89].  These 
include  algorithms  for  finding  spanning  trees,  connected  components,  and  graph 
traversals.  However,  the  uallel  algorithms  are  in  general  not  simply  direct  transfers  of 
the  serial-type  algorithms  to  parallel  form,  for  this  is  not  usually  possible.  Devising  a 
par<allel  algorithm  often  forces  a  completely  new  approach  to  the  problem  in  question 
and  may  recjuire  new  ways  in  which  to  r'‘present  the  data  so  as  to  be  able  to  exploit  the 
parallel  structure  of  the  machine.  Current  research  in  this  area  is  very  intensive. 

It  is  natural  to  expect  parallel  processing  to  be  of  considerable  help  in  reliability 
computations.  Hopefully  we  could  solve  many  of  the  .VP-hard  problems  referred  to 
earlier  in  polynomial  time,  and  for  many  of  them  this  is  in  fact  the  case,  but  only  at  a 
high  price.  For  example,  considi’r  the  Hamiltonian  tour  problem  referred  to  in  the 
previous  section.  With  sufficiently  many  processing  elements,  all  we  need  to  do  is  just 
dedicate  each  processor  to  the  task  of  generating  and  testing  its  own  candidate  solution. 
Sine*"  an  XP  problem,  by  its  very  definition,  admits  to  a  polynomial-time  algorithm  for 
checking  the  validity  of  a  solution,  if  the  problem  has  any  solution  at  all  then  one  of  the 
processors  is  guaranteed  to  find  it.  and  in  polynomial  time.  In  this  sense,  then,  any  XP 
prolilem  can  be  solved  in  polynomial-time.  (We  ignore  any  difficulties  that  might  arise 
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when  processors  need  to  communicate  or  synchronize  with  each  other,  although  this  is  a 
very  important  consideration  in  the  design  of  parallel  algorithms.) 

Of  course  we  now  have  to  devote  an  exponentially  large  number  of  processors  to 
solving  the  problem.  As  this  is  completely  out  of  the  question,  we  must  settle  for 
something  less  than  a  true  polynomial-time  algorithm.  The  minimum  we  want  is  to 
use  parallel  processing  to  obtain  algorithms  that  either 

(i)  process  our  current  networks  much  more  rapidly  than  the  available 
serial  algorithms 

or  (ii)  allow  us  to  deal  with  much  larger  networks  in  the  same  time  we  now 

devote  to  serial  calculations. 

The  extent  to  which  these  goals  are  realizeable  is  still  being  investigated.  In  addition  to 
the  problem  of  obtaining  a  polynomial-time  algorithm,  questions  about  algorithms  using 
only  a  polynomial  amount  of  storage  are  also  of  great  interest.  Dotson  s  method,  (see 
below),  for  example,  requires  an  amount  of  temporary  storage  which  grows 
exponentially  with  the  size  of  the  network. 
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6.  CALCULATION  OF  RELIABILITY 

As  noted  above,  the  problem  of  calculating  2-terminal  st-reliability  is.  in  the  worst 
case.  A'P-hard.  Therefore,  all  general  reliability  calculations  should  be  expected  to  take 
an  amount  of  time  which  grows  exponentially  with  the  size  of  the  input.  Nevertheless, 
some  algorithmic  approaches  to  the  problem  are  much  better  than  others.  Some  of  the 
more  recent  algorithms  require  much  less  time  to  complete  the  computation  than  was 
required  by  earlier  methods.  This  allows  the  exact  reliability  to  be  determined  for 
much  larger  networks  than  were  possible  previously.  In  addition,  the  wide  availability 
of  inexpensive  hardware  with  appreciable  amounts  of  high-speed  storage  permit  many  of 
these  calculations  to  be  done  relatively  quickly  in  a  desktop  envirc  iment  using  personal 
computers. 

In  this  section  we  discuss  a  number  of  the  more  recent  approaches  to  performing 
these  calculations.  However,  for  very  large  networks  no  currently  existing  computer 
machinery  is  even  remotely  capable  of  computing  their  reliability.  For  such  problems 
one  must  resort  to  approximation  methods  to  obtain  upper  and  lower  bounds  on 
Rel(G;s,t),  ideally  in  polynomial  time.  A  number  of  such  methods  are  available  but 
they  tend  to  be  somewhat  heuristic  in  the  sense  that  the  error  is  not  necessarily  known, 
although  some  of  these  do  maintain  current  error  estimates.  Also,  it  is  not  clear  what 
type  of  networks  these  algorithms  are  best  suited  for.  While  a  particular  algorithm  may 
appear  to  do  v'ery  well  for  one  particular  type  of  network,  no  one  can  claim  to  be 
uniformlv  better  than  another.  Direct  comparisons  between  them  which  remain  valid 
over  all  allowable  inputs  are  not  available. 

Finally,  if  the  available  bounds  on  network  reliability  are  not  sufficient,  one  can 
resort  to  Monte  Carlo  simulation  methods.  If  properly  implemented  these  are  capable 
of  maintaining  upper  and  lower  bounds,  but  convergence  is  generally  very  slow. 

6.1  STATE  SPACE  ENUMERATION 

The  most  direct  method  of  evaluating  network  reliability,  or  that  of  any 
probabilistic  Boolean  system,  is  to  enumerate  every  possible  state  of  the  network.  For 
our  communications  networks  there  are  a  total  of  2’’  such  possibilities  (2'’’*''*  if  node 
failures  are  also  considered),  since  each  edge  can  independently  be  either  up  or  down. 
This  decomposes  the  success  or  failure  event  into  its  most  elementary  subsets.  It  is 
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very  easy  to  implement  this  decomposition  and  to  accumulate  the  success  probabilities. 
However,  because  of  the  large  number  of  system  states  of  the  network,  it  is  not  practical 
to  use  such  a  method  to  carry  out  a  general  reliability  calculation  to  its  conclusion. 

Rather,  one  can  generate  the  most  probable  system  states  first  and  accumulate  the 
results  until  most  of  the  state  space  has  been  processed  [90-91].  For  example,  suppose 
that  the  k  most  probable  of  the  2’  elementary  events,  say  Gj  ,03  ,...  .a^.  ,  have  been 
determined.  If  we  then  define 

e  =  e{k)  = 

Gi  G  5 

and 

7  = 

G,  e  5 

where  S  is  the  success  event  (there  exists  an  operating  path  from  s  to  t),  then 
ReI(G;s.  t)  =  P{S)  and 

7<P(S)<7  +  €.  (6.3) 

This  method  would  be  useful  only  if  e  =  rapidly  as  k-*co,  which  requires  that  all 

of  the  network  components  have  a  very  small  probability  of  failure.  Thus  it  is  practical 
for  networks  with  very  reliable  components.  It  can  also  be  used  to  estimate  other 
network  functions  that  would  not  be  readily  available  should  a  much  coarser  partition  of 
the  state  space  be  employed. 

Thus  if  Q  is  some  quantity  defined  over  the  network  and  having  value  Q{a)  when 
the  network  is  in  state  a,  then 

E  P{a.)  Q{a.)  (6.4) 

a,e  s 

is  an  estimate  of  the  average  value  of  Q  over  the  success  set.  Similarly,  for  any 
constant  Qq, 

{  E  /  ( E 

V  «(•(>>  So  '  '  ' 

6  S 

approximates  ths  probability  that  Q  exceeds  Qq,  given  that  the  success  event  occurred. 
If  we  had  used  a  partition  of  S  into  non-elementary  subsets,  the  quantity  Q  might  vary 
significantly  over  such  subsets  and  the  simple  estimates  above  would  not  be  available. 

With  a  little  more  work  it  is  possible  to  maintain  upper  and  lower  bounds  on  such 
estimates,  including  st-reliability.  .4s  such,  these  algorithms  can  monitor  the 
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convergence  of  the  reliability  estimates,  but  the  rate  of  convergence  will  be 
unacceptably  slow  for  all  but  extremely  reliable  systems. 

6.2  INCLUSION-EXCLUSION  METHODS 

The  term  inclusion-exclusion  refers  to  a  class  of  methods  involving  a  decomposition 
of  the  success  or  failure  events  as  the  union  of  more  elementary  success  and  failure 
events,  but  much  coarser  than  an  enumeration  of  the  whole  state  space  [92-100].  As  an 
example,  consider  the  success  event  5.  which  occurs  if  there  is  at  least  one  operating 
path  from  s  to  t.  We  can  imagine  that  the  set  of  such  paths  have  been  enumerated  and 
consider  the  event  5,  that  all  of  the  edges  in  the  /-th  path  are  operating,  so  S  is  the 

union  of  the  events  5,,  say  i  =  I .  M.  The  inclusion- exclusion  principle  is  simply  the 

formula  for  the  probability  of  an  arbitrary  finite  union  of  sets,  we  have 

M 

S=  U  S.,  (6.6) 

i  =  1 

for  which  the  probability  of  success  is 

Rel  (G;s.#)=  F(5)  =  P(  Q  5.)  (6.7) 

t  =  1 

=  i;  05,)-  Y.  05,  n  5,) 

I  t  <  j 

+  Y  0(5,n5,n5,)- ... +(-i)«F(5,n...ns„). 

i  <  j  <  k 

The  failure  set  F  is  handled  in  a  similar  way,  now  using  a  cutset  enumeration.  An 
st-cutset  of  G  is  a  set  EqCE{G)  for  which  G  —  Eq  is  disconnected  with  s  and  t 
belonging  to  different  components  of  G  —  Eq.  It  is  said  to  be  minimal  if  no  proper 
subset  has  the  same  properties.  Suppose  that  the  minimal  st-cutsets  have  been 
enumerated  and  let  F,  be  the  event  that  all  of  the  edges  in  the  z-th  such  cutset  have 
failed,  say  1  <  z  <  N.  Then  F  is  the  union  of  the  sets  F^,....F„  and  the  inclusion- 
exclusion  principle  applies  again; 
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Rel{G;.s./)  =l-P{F)  =  l-P[  U 

i  =  1 

=  i-i:/>(F,)+  Y.  wnf,) 

I  i  <  J 

_  ...  +(_1)'V  +  1  P{F,n...nF_^). 

If  only  a  partial  enumeration  of  the  minimal  ^t-paths  and  ^f-cutsets  were  available. 

say  Si  and  F^  _ ,F„,  where  m  <  M  and  n  <  N,  then  we  can  obtain  upper  and 

lower  bounds  by 

/  .  .n 

P  (  U  )  S  Rel(G;s,!)  <1-P{  U  F,),  (6.9) 

2  =  1  2=1 

and  use  the  inclusion-exclusion  principle  to  evaluate  these  bounds. 

While  there  are  a  number  of  techniques  to  determine  the  minimal  5f-paths  and  si- 
cutsets  [97-100],  this  method  as  described  so  far  has  some  severe  drawbacks.  First,  the 
number  of  these  events,  M  and  N  respectively,  grow  exponentially  with  the  size  of  the 
network.  Even  worse,  the  number  of  terms  generated  by  the  application  of  the 
inclusion-exclusion  principle  is  itself  exponential  again  as  a  function  of  these  quantities. 
Also,  the  various  intersections  of  the  original  success  or  failure  events  frequently  overlap 
or  even  coincide.  This  leads  to  a  large  amount  of  unnecessary  calculation  of  the 
probabilities  of  these  intersection  events,  many  of  which  cancel  with  one  another.  Thus 
a  direct  application  of  these  methods  is  extremely  inefficient  and  should  not  be  used  for 
anything  but  very  small  networks. 

Significant  improvements  can  be  made  in  this  approach  if  one  avoids  the  repeated 
calculation  of  canceling  terms  in  these  expansions  [95,  96).  This  is  possible  because  the 
success  probability 


P{S)  =  P[  jj  5.)  (6.10) 

i  =  1 

can  be  expressed  by  means  of  non-cancelling  terms  which  can  be  put  into  a  one-one 
correspondence  with  certain  subgraphs  of  G.  We  say  that  a  subgraph  Gq  C  G  is  an  st- 
subgraph  if  every  edge  of  Gq  lies  in  some  path  of  Gq  which  connects  the  source  to  the 
sink.  The  non-cancelling  terms  in  P{S)  will  correspond  to  the  acyclic  5t-subgraphs, 
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those  which  have  no  directed  cycles,  while  terms  which  cancel  identically  result  from 
those  which  admit  cycles.  Figure  6.1  shows  the  various  subgraphs  in  a  five-node 
network. 

Hence  in  calculating  P{S)  it  is  only  necessary  to  compute  the  probabilities  P{Ga) 
for  the  acyclic  sf-subgraphs  of  G.  and  there  are  far  fewer  of  these  than  the  number  of 
terms  in  the  inclusion-exclusion  expansion  formula  (although  the  number  of  these 
subgraphs  still  grows  exponentially  with  the  size  of  the  network).  The  acyclic  st- 
subgraphs  can  be  systematically  constructed  by  a  succession  of  edge  replacements,  and 
the  st-reliability  formula  so  generated  is  in  a  factored  form  which,  if  multiplied  out,  is 
equivalent  to  the  sum  of  the  non-cancelling  monomial  terms  given  by  the  inclusion- 
exclusion  form.  However,  the  terms  that  result  here  are  not  all  positive,  so  terminating 
the  process  prematurely  does  not  necessarily  give  a  lower  bound  on  the  reliability. 

These  methods  can  also  be  applied  to  other  measures  of  network  reliability  (e.g.  K- 
terminal.  all- terminal).  A  somewhat  simpler  procedure  for  the  all-terminal  reliability 
Rel(G)  can  be  based  on  the  spanning  trees  of  G  instead  of  st-subgraphs.  This  will  give 
lower  bounds  on  the  reliability,  but  applies  only  to  undirected  networks  [93].  They  offer 
a  tremendous  improvement  over  the  direct  application  of  the  inclusion-exclusion 
principle  and  make  possible  exact  reliability  computation  for  networks  that  were  too 
large  to  be  amenable  to  this  type  of  analysis  by  previous  algorithms.  Nevertheless, 
there  are  more  recent  methods  which  perform  even  more  efficiently  and  which  give 
upper  and/or  lower  bounds  on  the  st-reliability  when  terminated  before  completion  of 
the  calculations. 

It  is  interesting  to  note  [102]  that  if  all  of  the  minimal  st-cutsets  of  G  have  been 
enumerated,  say  \i,  ....  then  there  is  an  algorithm  for  the  st-reliability  that  has 
polynomial  complexity  in  .V.  Specifically,  its  runtime  growth  is  0(j^p  +  q)  While 

it  is  true  that  y  may  grow  exponentially  with  p  and  q,  for  some  networks  it  grows  more 
slowly  than  this  worst-case  behavior  would  indicate.  In  general,  moreover.  N  will 
increase  less  rapidly  with  the  network  size  than  the  totals  for  other  quantities  used  in 
reliability  calculations,  such  as  the  number  of  spanning  trees.  Surprisingly,  however, 
the  corresponding  result  for  paths  is  false  (unless  every  iVP-complete  problem  hcis  a 
polynomial-time  algorithm).  That  is.  if  all  of  the  st-paths  of  G  have  been  collected,  say 

TT, .  TTyj,  there  is  no  st-reliability  algorithm  having  polynomial-time  complexity  as  a 

function  of  M.  In  fact,  this  is  true  even  for  a  class  of  graphs  for  which  M  grows 
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(a)  Non  st-subgraph 

(Not  every  edge  is  in  an  st-path) 


(b)  Acyclic  st-subgraph 

(Every  edge  is  in  an  st-path) 


(c)  Cyclic  st-subgraph 

(Every  edge  is  in  an  st-path) 

FIGURE  6.1  SUBGRAPHS  OF 
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only  polynomially  as  a  function  of  p  and  q,  so  the  exponential  growth  in  the  worst-case 
complexity  of  sf-reliability  calculations  is  not  due  merely  to  the  fact  that  the  number  of 
st-paths  may  be  large. 

6.3  DISJOINT  SUMS 

An  alternative  to  using  the  general  formula  for  the  probability  of  an  arbitrary 
union  of  sets,  as  in  the  inclusion-exclusion  principle,  is  to  express  the  success  or  failure 
event  as  a  union  of  disjoint  sets  [101-104].  For  such  disjoint  unions  the  individual 
probabilities  simply  add  up  directly  with  no  cancellation.  For  an  example  of  how  this 
can  be  done,  consider  the  success  event 

5=  ij  5.  (6.11) 

I  =  1 

as  described  above.  We  can  define  an  associated  sequence  of  success  events  as 

5'  =  (6.12a) 

S' =52  -5,  =  52(1  S,  .  (6-12b) 

5',  =  5,-(5,U_52U---Ufii--.) 

=  5fcn5inS2n...nSfc_,.  k  =  2,...M.  (6.12c) 

(Here  .A  denotes  the  set  complement  of  A  in  the  state  space.)  It  is  clear  that  these 
events  are  mutually  disjoint  and  that  their  union  is  5.  whence 

M 

Rel(G:s.f)  =  P(?)  =  £  P(5')  .  (6.13) 

I  =  1 

Obvdously  a  similiar  procedure  applies  to  the  failure  event,  and  partial  results  can 
be  obtained  by  truncating  the  process  at  any  stage  to  yield  upper  and  lower  bounds  on 
the  reliability.  There  are  a  large  number  of  variations  on  this  theme,  for  there  is  no 
reason  to  enumerate  all  the  success  events  S,  first  and  then  construct  the  disjoint  events 
5;.  It  would  obviously  be  preferable  to  construct  the  disjoint  partition  of  5  into 

S', . S’„  directly  by  first  generating  a  success  event  S,.  then  a  .second  such  set  S'2 

disjoint  from  the  first,  and  continuing  until  the  full  success  event  5  has  been  exhausted. 

This  is  frequently  done  using  Boolean  algebra  constructions  and  inversion 
(set  complementation)  techniques  to  obtain  disjoint  Boolean  forms  [101,  103,  114].  Here 
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the  Boolean  variables  would  be  .Y,,  /  =  1 _ ,q  with  .Y,  true  if  edge  number  i  is 

operating,  and  false  (A',  true)  otherwise.  General  methods  of  Boolean  algebra  can  be 
applied  in  this  setting  so  as  to  attempt  to  minimize  the  number  of  terms  involved  in 
representing  the  Boolean  function  which  represents  a  successful  st-connection.  That  is, 
an  economical  representation  of  the  success  function  would  mean  an  efficient  partition 
of  the  success  event  into  disjoint  subsets,  hence  a  relatively  small  number  of  terms  in 
the  expression  for  Rel(G:s. t).  While  research  is  still  continuing  in  this  area  [109-113], 
one  should  note  that  the  problems  in  Boolean  algebra  associated  with  this  approach  are 
themselves  A' P- hard. 

As  an  example,  consider  that  probability  of  the  failure  event  P,  which  is  initially 
expressed  as  the  (non-disjoint)  union  of  the  minimal  st-cutsets.  The  disjoint  sums  are 
built  up  inductively,  with  each  cutset  contributing  mutually  disjoint  terms  which  are 
themselves  disjoint  with  respect  to  all  the  cutsets  previously  processed.  Earlier 
algorithms  of  this  type  involved  complementing  individual  Boolean  variables  one  at  a 
time,  but  substantially  fewer  disjoint  terms  may  result  if  complemented  products  are 
used.  A  simple  illustration  of  this  is  given  by  the  two  terms 


5i  =  AiAjA'a  and  Sj  =  A^A'j,  (6.14) 

whose  sum  is  to  be  expressed  as  a  disjoint  sum.  Single  term  complementation  of  the 
product  5  gives 

5i  =  ATaTY^  =  a,  +  A,A2  +  A,  A2A^  (6.15a) 

thereby  resulting  in  a  four-term  disjoint  expansion  for  S  =  Si  +  S2,  namely 

S  =  A,A2A3  +  A,A,A5  +  A,A2A,A5  +  XiX^X.X.X,,  (6.15b) 
and  its  corresponding  reliability  expression 


P(5)  =  P1P2P3  +  (1  -  Px}P4P5  +  P\i^  -  P2)P^P5  +  P1P2  (1  -  P3)P4P5,  (6.15c) 

where  p,  is  the  probability  that  the  variable  A'',  is  true.  Using  the  direct 
complementation  of  the  product,  however,  gives  a  two-term  disjoint  sum 

S  =  AiA2A3  +  (A7A^A%A%,  (6.16a) 

and  its  corresponding  reliability 


P{S)  =  P1P2P3  +  (1  -  PtP2P3)P4P5- 


(6.16b) 


Of  course,  these  two  expressions  give  exactly  the  same  reliability.  This  simple 
difference  can  result  in  a  large  decrease  in  the  number  of  terms  needed  in  order  to 
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express  the  failure  event  F  as  a  disjoint  sum.  While  the  number  of  terms  still  exhibits 
exponential  growth,  the  rate  of  growth  is  much  less  [107]. 

6.4  DOTSON’S  METHOD 

Dotson’s  Method  [115-119]  is  an  interesting  variation  on  the  disjoint  sum  approach 
just  described  in  that  it  simultaneously  generates  both  success  and  failure  events  as  it 
proceeds.  As  these  events  are  mutually  disjoint,  the  success  and  failure  probabilities 
can  be  accumulated  by  simple  additions  as  the  algorithm  is  running,  so  there  are  always 
available  both  an  upper  and  a  lower  bound  on  the  st-reliability.  Thus  the  error  in  these 
bounds  can  be  continually  monitored  and  the  algorithm  can  be  terminated  at  any  time, 
such  as  whenever  pre-assigned  convergence  criteria  have  been  satisfied.  If  the  algorithm 
is  run  to  termination  then  a  complete  description  of  both  the  success  and  failure  events 
has  been  computed. 

This  method  seems  to  have  been  somewhat  unjustly  neglected  since  its  original 
publication  [116],  so  we  will  give  an  example  of  applying  it  to  a  simple  network  later. 
We  have  recently  made  extensive  use  of  it  for  st-reliability  calculations,  as  it  has  a 
number  of  advantages  over  some  of  the  procedures  mentioned  above.  The  method 
begins  by  finding  some  initial  path,  say  tt,  from  the  source  to  the  sink.  Such  a  path, 
which  must  exist  if  the  sink  is  reachable  from  the  source  is  then  the  first  element  in  a 
collection  of  success  events.  If  the  length  of  tt  is  k,  then  each  of  the  k  edges  of  tt  are 
successively  complemented  (disabled),  one  at  a  time,  and  these  complemented  events 
are  stored  in  some  temporary  working  array,  say  W. 

In  the  general  stage  of  the  algorithm  we  examine  the-  events  w  CW  one  at  a  time 
and  for  each  such  event  we  search  for  an  s<-path  in  the  graph  =  G  —  E^,  where 
is  the  set  of  edges  in  the  event  w  under  consideration  which  have  been  disabled.  If  this 
is  not  found  then  w  is  added  to  the  (initially  empty)  collection  of  failure  events. 
Otherwise  a  new  s<-path  is  found,  say  It  is  added  to  the  collection  of  success  events 
and  its  complementary  events  stored  in  a  new  temporary  working  array  W.  This 
procedure  continues  until  all  w  6  U’  have  been  so  processed,  at  which  time  the  whole 
routine  is  repeated  again  with  the  newly  constructed  working  array  W  in  place  of  W . 
The  main  point  of  the  Dotson  method  is  the  use  of  the  complementary  events  to 
generate  subsequent  success  and  failure  events,  but  its  effectiveness  depends  on  the 
particular  path-finding  method  employed.  In  particular,  it  is  important  to  generate  the 
most  probable  success  events  early  in  the  process,  and  efficient  path-finding  algorithms 
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for  this  purpose  are  described  in  [117]  and  [118]. 

If  run  to  completion  the  algorithm  will  terminate  with  W  =  0,  no  further  events  in 
the  temporary  array.  In  this  case  the  success  and  failure  events,  5  and  F,  are 
completely  exhausted  by  the  success  and  failure  collections  which  have  been 
accumulated  during  the  above  procedure.  The  sf-reliability  can  then  be  computed  from 
either  of  these  collections  as 

Rel  (G; s,t)  =  P(S)  =  1  -  P(F)  ,  (6.17) 

or  these  probabilities  could  have  been  continually  updated  as  the  algorithm  proceeded 
to  monitor  the  error.  The  relative  efficiency  of  Dotson’s  method  is  that  the 
success/failure  partitioning  of  5  and  F  constructed  thereby  are  rather  coarse.  For 
example,  if  there  is  an  edge  in  E(G)  which  connects  s  and  t,  then  this  edge  is  the  first 
success  event  rr  found  and  it  is  thereafter  never  considered  again,  immediately  reducing 
the  size  of  the  state  space  by  a  factor  of  2. 

The  main  drawback  to  Dotson's  method  is  that  the  intermediate  storage  required 
for  the  temporary  arrays  may  be  very  large.  (Of  course,  the  number  of  success  and 
failure  events  found  by  the  algorithm  will  also  be  large,  but  these  need  not  be  stored  if 
the  probability  updates  are  done  right  away.)  Whether  this  is  true  or  not  depends  on 
the  order  in  which  the  events  in  the  working  array  are  processed.  If  this  array  is 
implemented  as  a  simple  stack  (LIFO),  then  the  number  of  temporary  events  that  must 
be  stored  does  not  grow  exponentially  with  the  network  size  and  there  is  no  problem 
with  intermediate  storage. 

However,  Dotson's  method  should  find  the  shorter  st-paths  earlier  in  the  search, 
and  since  these  are  the  more  probable  it  is  desirable  to  process  them  early  in  the 
algorithm  [99].  This  requires  operating  the  temporary  storage  array  as  a  queue  (FIFO), 
and  this  will  result  in  large  storage  arrays  whose  size  will  grow  exponentially  with  the 
size  of  the  network.  Moreover,  if  the  algorithm  is  to  be  terminated  early  then  FIFO 
operation  must  be  employed.  (Stack  operation  would  leave  highly  likely  e%'ents  still 
unprocessed.) 

Figure  6.2  gives  an  example  of  Dotson's  method  applied  to  a  simple  4-node  network 
with  all  edges  given  a  probability  of  0.8  of  being  operational.  There  are  four  success 
events  and  five  failure  events  and  the  nine  corresponding  network  configurations  cire 
shown  in  Figure  6.3.  Note  that  none  of  the  nine  subgraphs  shown  admit  cycles.  Also, 
each  of  the  four  success  events  gives  a  subgraph  of  the  network  in  which  t  is  reachable 
from  s,  but  not  all  of  these  are  .st-subgraphs.  In  general.  Dotson's  method  generates 
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Number  of  Nodes:  4 

Number  of  Links:  5 

Source  Node:  1 

Terminal  Node:  4 

Max  Hop  Distance:  3 

Edge  List: 

1  2 

1  3 

2  3 

2  4 

3  4 

Output  of  Dotson  Algorithm  -  Summary 

>>>  Next  working  queue  size:  1  <<< 
>>>  Next  working  queue  size:  2  <<< 
>>>  Next  working  queue  size:  4  <<< 
>>>  Next  working  queue  size;  2  <<< 


Number  of  Successes;  4 

Number  of  Failures:  5 

Number  of  Events:  9 

Link  Probability:  0.80 

Lower  Bound:  '  0.8908800 

Upper  Bound:  0.8908800 

st-Reliability  0.8908800 


Output  of  Dotson  Algorithm  -  Success/ Failure  Events 

+  21121 
+  02112 
+  22102 
-  00111 
-  02110 
+  20202 
-  22100 
-  20001 
-  20200 

Shows  the  edge  configuration  of  the  accumulated  events 
Success  event  positive  (  +  ),  failure  event  negative  (-) 
Working  =  '2’,  not  working  =  O’,  doesn’t  matter  =  T’ 
Edges  are  enumerated  as  ordered  in  the  input  edge  list 


FIGURE  6.2  EXAMPLE  OF  DOTSON'S  METHOD 
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SUCCESS  EVENTS  FAILURE  EVENTS 


FIGURE  6.3  EXAMPLE  OF  DOTSON’S  METHOD  (  CONT'D  ) 
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fewer  success  events  than  there  are  .sf-subgraphs.  as  used  in  the  inclusion-exclusion 
methods. 

The  method  has  some  advantages.  For  one,  it  is  easy  to  restrict  the  hop  length 
used  in  the  path  search  routines  [117.  118].  Thus  for  any  integer  n.  I  <n  <  q,  we  can 
use  the  method  to  estimate  the  n-hop  reliability,  Rel„(G;  s,  f),  the  probability  that  the 
nodes  s  and  t  are  connected  in  G  by  a  path  of  length  at  most  n.  This  is  useful  if  for 
some  reason  there  is  a  limit  on  the  number  of  relays  which  can  be  used  in  the  node-to- 
node  transmission  of  a  message. 

-Also,  it  is  easy  to  account  for  possible  node  failures  in  this  algorithm,  although 
doing  this  directly  is  not  a  good  idea.  (Adding  node  failures  to  the  original  network 
model  increases  the  size  of  the  state  space  by  a  factor  of  2^.)  Much  more  efficient  is  the 
equivalent  links  method  [117,118],  whereby  one  runs  a  modified  version  of  the  Dotson 
method  for  a  network  initially  modelling  only  edge  failures,  but  instead  of  accumulating 
the  success/failure  probabilities  we  will  save  the  success/failure  events  to  some 
permanent  storage  file.  This  file  can  then  be  fitted  to  any  edge  probability  function 
P:  E{G)--*  IR  to  compute  or  bound  the  st-reliability.  Moreover,  if  we  are  also  given  a 
node  probability  function  (5:  V’(G)— ♦R.  then  it  is  also  possible  to  hack- fit  the  edge  events 
generated  by  the  Dotson  algorithm  to  account  for  both  node  failures  and  edge  failures. 
This  can  be  done  at  any  time,  even  if  there  had  originally  been  no  intention  of 
modelling  node  failures,  because  of  the  concept  of  an  “equivalent  link'’  that  combines 
edge  failures  and  node  failures. 

A  brief  description  of  the  equivalent  links  method  follows.  Assuming  (temporarily) 
that  the  source  node  s  is  completely  reliable  (Ql'?)  =  1),  we  consider  that  a  successful 
communication  occurs  across  a  link  exactly  when  both  the  edge  in  question  and  its 
terminal  (receiving)  vertex  are  operational.  This  is  equivalent  to  replacing  the 
probability  P{e)  of  edge  e  =  uc  €  E[G)  by  the  product  Q(r)P(e).  Now,  however,  two 
links  are  independent  only  if  they  have  distinct  terminal  vertices.  Therefore,  if  5  is  any 
success  event  generated  by  Dotson's  method,  then 

P(5)  =  g(s)  n  (6-18) 

f  6  V(G) 

where  5(r)  is  the  restriction  of  the  event  5  to  those  links  which  terminate  at  verte.x  v. 

Given  that  node  r  is  operational,  the  links  in  5(r)  are  operational  exactly  when  the 
corresponding  edges  in  the  original  network  are  operational,  and  this  observation  allows 
us  to  express  the  probability  of  the  event  S{v)  in  terms  of  the  original  node  and  edge 
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probabilities.  Let  and  S_{v),  be  the  set  of  links  in  S{v)  which  are  specified  by  S 

to  be  operational,  and  to  have  failed,  respectively.  Let  M  =  |  S^(v)  |  and 
N  =  I  5'_(n)  I  .  Then  the  probability  of  the  restricted  event  S{v)  is  given  by  the 
following  back-fitting  forumulas: 


P[S(i-))  = 


Q(v)  n 

e  e  S^(v) 


M  =  0,N  =  0 


M>o.y  =  0 


(6.19) 


1  -  Qiv) Qiv)  n  v\/  =  0,.V>0 

e  6  S_{v) 

Q{v)  n  Pi^)  n  [i-w]  M>o,.v>o 

e6S^(u)  eeS_(v) 

Of  course  if  the  nodes  are  completely  reliable  =  1  for  all  v  G  V{G)^  this  gives 
the  usual  probability  of  the  success  event, 


p{S)  =  n  pi^)  n  (6-20) 

e  6  e  6  5_ 

where  5+  (resp  S_)  are  those  edges  in  5  which  are  required  to  be  operational  (resp.  to 
have  failed). 


Similiar  back-fitting  formulas  apply  to  the  failure  events  generated  by  the  Dotson 
method,  except  that  at  the  end  of  the  calculations  one  should  subtract  the  accumulated 
failure  probability  from  Qis)^  instead  of  from  1.0,  to  obtain  the  upper  bound  on  the  st- 
reli  ability. 


The  back-fitting  formulas  (6.19)  from  the  equivalent  links  method  might  also  be 
applied  to  other  .st-reliability  algorithms  which  compute  a  symbolic,  rather  than  a 
numeric,  representation  of  the  reliability  [111,113].  That  is,  provided  sufficient 
information  is  available  in  the  description  of  the  edge  events,  it  would  be  possible  to 
incorporate  node  failures  into  the  result.  With  equivalent  links  method,  in  fact,  a  file  of 
edge  events  generated  using  Dotson’s  method  can  even  be  used  to  calculate  st-reliability 
for  networks  with  perfectly  reliable  edges  and  only  unreliable  nodes. 


6.5  FACTORING  METHODS 


Edge  factoring  [120-126]  is  a  very  attractive  method  for  reliability  calculations  of 
moderate  size  (20-25  vertices).  It  is  based  on  the  fact  that  for  a  given  undirected 
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R{G)  =  P{e)R{G/e)  +  [I  -  P{e)\  RiG  -  e).  (6.21) 

Here  R{G)  is  any  of  the  reliability  measures  defined  above,  and  G/e  denotes  the  graph 
obtained  by  contraction  on  e.  That  is.  we  remove  edge  e  =  uv  from  the  network  and 
identify  the  vertices  u  and  v  which  were  joined  by  e.  In  electrical  engineering  terms  this 
is  best  thought  of  as  replacing  the  edge  e  by  a  short  circuit.  .4nalogously,  G  —  e  is 
obtained  by  the  replacement  of  e  by  an  open  circuit.  (See  Figure  6.4).  Thus  the 
reliability  problem  is  reduced  to  computing  the  reliability  of  two  smaller  networks. 


FIGURE  6.4  CONTRACTION  ON  EDGE  e 

We  will  consider  only  the  2-terminal  reliability  R{G)  =  Rel(G;s,<)  here.  The  edge 
factorization  formula  in  this  case  will  also  apply  to  directed  networks  if  the  method  is 
restricted  to  suitable  edges.  One  should  factor  only  on  an  edge  which  either  goes  out  of 
the  source  or  goes  into  the  sink,  because  the  edges  in  the  opposite  directions  axe  not 
relevant  to  the  s<-reliability  [122].  These  opposite  edges  can  all  be  initially  deleted  from 
the  graph  with  no  effect  on  R{G).  Indeed,  this  should  always  be  done,  not  only  to  the 
original  graph  G,  but  also  to  G/e  and  G  —  e. 

The  factorization  method  is  intrinsically  recursive  in  that  the  process  should  be 
repeated  so  that  G/e  and  G  —  e  are  each  factored  on  one  of  their  edges.  Since  each  step 
reduces  the  size  of  the  graphs  in  question  by  one  edge  (and  one  node  for  G/e),  the 
procedure  will  eventually  result  in  trivial  networks.  However,  in  this  form  the  use  of 
factorization  is  not  practical  since  it  would  result  in  a  binary  tree  of  reduced  graphs  of 
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level  q,  meaning  that  2'^  —  1  such  graphs  would  have  to  be  processed  before  the  recursion 
was  complete.  (Some  branches  may  terminate  early  if  the  source  and  the  sink  become 
disconnected  as  a  result  of  the  edge  deletions,  but  this  will  not  usually  occur.) 

What  makes  this  method  feasible  is  that  it  can  be  combined  with  edge  reduction 
methods  to  greatly  reducf^  the  size  of  the  recursion  tree.  For  example,  contraction  on 
an  edge  frequently  results  in  parallel  edges  (two  edges  incident  on  the  same  two  nodes) 
which  can  be  combined  into  a  single  edge.  Similarly,  parallel  reduction  can  result  in 
serial  edges  for  which  another  reduction  is  possible.  Because  of  the  assumed 
independence  of  the  edge  failures,  the  new  edge  probabilities  can  easily  be  determined 
so  that  the  reliability  of  the  reduced  network  is  the  same  as  that  of  the  original.  These 
edge  reductions  are  shown  in  Figure  6.5. 


FIGURE  6.5  PARALLEL  AND  SERIES  EDGE  REDUCTION  {w  ^  s,t) 

Additional  reductions  are  often  possible,  for  if  u  ^  s  is  such  that  indeg(u)  =  0  (u  is  a 
false  start  node)  or  if  u  t  has  outdeg(u)  =  0  (u  is  a  dead  end  node)  then  v  and  its 
incident  edges  can  be  deleted  from  the  network.  Similarly,  any  pendant  node  other 
than  the  sink  and  the  source  may  be  removed.  None  of  these  deletions  has  any  effect  on 
the  network  reliability  (see  Figure  6.6). 
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FIGURE  6.6  FALSE  START.  DEAD  END.  PENDANT  NODES  {v  ^  s.t) 


Finally,  for  directed  networks  an  important  type  of  reduction  can  be  used  on 
vertices  which  have  only  one  incoming  or  outgoing  edge  [123].  Here,  if  the  incoming 
edge  has  an  outgoing  edge  in  the  opposite  direction,  then  that  outgoing  edge  may  be 
removed  if  the  node  in  question  is  not  the  source  node.  The  other  case  is  similar,  and 
examples  are  si  own  in  Figure  6.7. 


FIGURE  6.7  REMOVE  EDGE  e  IF  u  7^  sA 


The  implementation  of  an  efficient  factorization  algorithm  is  then  to  repeatedly 
apply  to  G  all  the  allowable  edge  reductions  until  no  further  simplifications  are  possible. 
One  then  applies  the  factorization  formula  for  some  edge  e  ^  E(G).  say  one  going  out  of 
the  source  node,  and  repeats  this  process  to  G/e  and  G  —  e  before  they  are  factored  on 
one  of  their  edges.  The  result  is  often  a  very  rapid  loss  of  edges  and  a  drastically 
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one  of  their  edges.  The  result  is  often  a  very  rapid  loss  of  edges  and  a  drastically 
pruned  binary  tree  for  the  recursive  function  calls.  For  example,  if  either  Gje  ox  G  —  e 
is  ever  a  series-parallel  graph,  then  the  edge  reduction  wih  reduce  it  to  a  trivial  graph 
and  factorization  is  never  applied  again.  While  it  is  difficult  to  predict  in  advance  how 
great  a  reduction  in  the  recursion  tree  actually  takes  place,  these  methods  have  proved 
to  be  quite  competitive  with  those  already  discussed. 

One  great  advantage  of  this  method  is  that  very  little  computer  memory  is 
required.  Even  when  written  in  a  recursive  form  in  a  language  such  as  Pascal  which 
supports  recursive  function  calls  and  uses  available  stack  space  for  the  storage  of  local 
variables,  memory  availability  should  not  be  a  problem.  Roughly  speaking,  the  worst- 
case  memory  requirement  is  0{q^),  l)ut  this  will  be  completely  dominated  by  the 
exponential  growth  in  the  runtime. 

A  disadvantage  is  that  a  change  in  the  edge  probabilities  require  that  the  entire 
algorithm  be  repeated.  With  edges  being  merged  with  one  another  or  factored  out  of 
the  calculations,  the  edge  probabilities  are  being  combined  in  an  analogous  way  and  the 
algorithm  does  not  keep  track  of  just  how  this  was  done.  While  it  might  be  possible  to 
save  this  information  so  as  to  reprocess  the  graph  with  a  new  set  of  edge  probabilities, 
doing  so  would  require  a  large  amount  of  storage  to  record  all  the  branching  in  the 
recursion  tree,  losing  the  low  memory  advantage. 

Finally,  the  factorization  formula  (6.21)  can  be  modified  to  apply  to  the  situation 
where  both  nodes  and  edges  can  fail  independently,  without  greatly  increasing  the 
complexity  of  the  algorithm  [117.  125].  The  modifications  are  analogous  to  the 
equivalent  link  method  discussed  in  the  previous  section  on  the  Dotson  method.  We 
shall  give  a  brief  description  of  how  this  is  done,  and  as  before  we  let  Q:y(G)— >iR  be  the 
associated  node  reliabilities. 

The  initial  problem  here  is  that  the  contraction  of  G  on  an  edge  e  =  uu  is  to  result 
from  a  graph  in  which  the  communication  from  node  u  to  node  v  is  possible,  hence  u 
and  r  must  be  operational  and  should  be  merged  into  a  new  node  that  is  completely 
reliable.  (That  is,  if  either  u  or  v  had  failed  we  could  not  have  had  a  successful  edge 
connection  across  this  link.)  Thus  the  end  points  u  and  v  as  well  as  the  edge  e  =  uv 
itself  must  all  be  operational,  and  this  occurs  with  probability 

P'ie)  =  Q(u)  Pit)  Q(v)  (6.22) 

because  of  the  statistical  independence.  However,  the  failure  of  the  link  can  be  due  to 
either  the  failure  of  the  edge  itself  or  that  of  either  of  the  two  nodes.  While  we  may  not 
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know  which  component  has  failed,  we  may  delete  the  edge  from  the  graph  anyway  since 
the  failure  of  a  node  implicitly  implies  the  failure  of  all  edges  that  are  incident  to  it. 
Thus  the  factorization  formula  will  hold  if  the  node  reliabilities  of  the  depleted  graph 
G  —  e  are  modified  appropriately,  which  implies  that  we  use 


Q(e)[l-Q(u)P(e)]_Q(f)  -P'je) 
1  -  P'{e)  1  -  P'{e) 


(6.23) 


with  a  similar  modification  for  the  node  u. 

The  immediate  difficulty  with  this  approach  is  that  the  node  failures  in  the  graph 
G—e  are  no  longer  statistically  independent.  We  could  avoid  this  problem  if  we  only 
factored  on  an  edge  e  =  uv  for  which  the  initial  node  u  was  perfectly  reliable,  for  if 
(5(»)  =  1  then  Q'{u)  =  1  and 


Q'iv) 


Qjv)  [i-^(^)] 

1  -  P{e)Q{v) 


(6.24) 


still  describe  statistically  independent  failures.  However,  we  can  always  assume  that 
the  source  node  s  is  perfectly  reliable,  so  we  will  maintain  statistical  independence 
throughout  the  process  if,  as  in  the  Page-Perry  algorithm,  we  always  perform  the 
factorization  on  edges  leaving  the  source  node,  so 

R{G)  =  P'{e)R(Glt)  +  [1  -  P'(e)]i?(G  -  e)  (6.25) 

for  such  edges,  the  edge  and  node  reliabilities  in  the  derived  graphs  G/e  and  G  —  e 
having  been  adjusted  as  described  above.  (The  true  reliability  Q(s)  of  the  source  node 
is  included  as  a  factor  of  the  sf-reliability  at  the  end  of  the  recursion.) 

As  with  the  use  of  the  factorization  theorem  for  edge  failures  only,  a  practical 
algorithm  results  only'  when  it  is  combined  with  the  edge  reduction  schemes  described 
above.  However,  all  of  these  can  still  be  applied,  with  only  minor  modifications,  to 
account  for  node  failures  as  well.  .Aside  from  this  observation,  the  factorization 
calculation  then  proceeds  exactly  as  before. 

This  modification  of  the  factorization  theorem  and  the  equivalent  links  algorithm 
using  Dotson's  method  are  the  best  available  methods  for  computing  sf-reliability  for 
networks  in  which  both  node  and  vertex  failures  are  modelled.  Each  has  its  own 
advantages  and  disadvantages.  The  factorization  technique  is  generally  much  faster 
(when  combined  with  the  appropriate  edge  reduction  algorithms)  and  uses  very  little 


00 


Calculation  of  Reliability 


intermediate  storage,  but  any  change  in  the  edge  and/or  node  reliabilities  require  that 
the  algorithms  be  repeated  from  the  beginning.  The  Dotson  method,  however,  always 
maintains  bounds  on  the  reliability.  Moreover,  if  a  permanent  record  of  the 
success/failure  events  is  maintained  then  these  can  be  fitted  over  and  over  again  to  any 
distribution  of  the  component  reliabilities  without  repeating  the  original  computation. 
.41so.  while  it  is  possible  to  truncate  the  recursion  tree  in  the  factorization  method  so  as 
to  obtain  upper  or  lower  bounds  in  a  smaller  time,  the  method  does  not  allow  a 
continual  monitoring  of  its  progress  as  it  proceeds. 

6.6  .APPROXIMATION  AND  SIMULATION 

Since  the  exact  calculation  of  the  graph  reliability  is  A’P-hard.  one  inquires  after 
efficient  methods  for  computing  upper  and  lower  bounds  [127-138].  We  have  already 
referred  to  some  of  these  in  an  earlier  section  when  we  assumed  a  constant  edge 
probability.  These  reduced  to  some  graph  enumeration  problems,  although  still  NP- 
hard.  There  are  a  number  of  algorithms  of  this  type.  We  discuss  in  some  detail  a  so- 
called  edge-packing  algorithm  which  will  apply  to  the  case  of  arbitrary  edge 
probabilities  [134.  135|.  -As  above,  we  consider  only  the  2-terminal  .st-reliability  and  use 
the  sets  of  st-paths  and  sf-cutsets.  We  conclude  with  some  references  to  Monte  Carlo 
applications  to  reliability  calculations. 

-An  edge-packing  of  a  digraph  G  =  G{E,V)  is  simply  a  collection  {G, .  G„}  of 

subgraphs  of  G  which  have  no  edges  in  common.  Bounds  on  the  st-reliability  can  be 
obtained  by  imposing  appropriate  restrictions  on  the  type  of  subgraphs  considered.  Let 
us  first  suppose  that  each  such  subgraph  is  an  sf-path.  say  and  define  the  probability 
of  its  success  simply  as  the  probability  that  each  edge  is  operational.  Thus,  by  the 
independence  assumption  on  the  link  failure  events. 

p{-k}=  n  (^-26) 

e  G  TTj. 

Now  if  the  nodes  .s  and  t  are  not  connected  in  G  {t  not  reachable  from  s)  then  all  o{ 

the  .^t-paths  rr, . must  fail.  (The  converse  is  false,  however,  since  there  may  be 

other  st-paths  in  addition  to  the  given  t, . Since  these  paths  are  assumed  to  be 

edge- disjoint. 

l-Rel(G:.s.f)  <  H  [l-P(~k)]  (6-27a) 
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SO  this  type  of  edge-packing  gives  a  lower  bound  on  the  st-reliability.  Given  such  a 
lower  bound  we  can  look  to  improve  it  by  attempting  to 

(i)  find  an  additional  st-path  +  i  edge-disjoint  from  tt,,.... 7r„ 

(ii)  replace  one  of  these  st-paths  by  another  which  is  more  reliable  but  still 
edge-disjoint  from  the  remaining  paths. 

These  two  aims  are  somewhat  contradictory,  and  it  is  not  obvious  how  one  should 
choose  an  edge-packing  by  s/-paths  so  as  to  optimize  the  lower  bound.  One  should 
note,  however,  that  the  maximum  number  of  such  paths  is  known  from  a  variation  of 
Monger's  theorem  [3]: 

The  maximum  number  of  edge-disjoint  sf-paths  is  equal 
to  the  minimal  number  of  edges  in  an  sf-cutset. 

However,  an  sf-path  edge-packing  using  this  maximum  number  of  such  paths  does  not 
necessarily  give  a  good  lower  bound.  Given  a  choice  between  using  two  long  paths  and 
one  short  path  lor  vice  versa),  either  selection  might  be  more  reliable  than  the  other 
and  result  in  a  better  lower  bound.  .4  recent  article  of  Torrieri  [137]  presents  some 
algorithms  for  generating  disjoint  sf-paths.  Also  note  that  since  edge-disjoint  sf-paths 
are  necessarily  also  node-disjoint,  it  is  easy  to  include  node  failures  in  the  lower  bound 
estimates. 

To  obtain  an  upper  bound  we  instead  make  use  of  an  edge-packing  {,\i,  ....  x-„} 
consisting  of  disjoint  st-cutsets  [1381.  If  the  network  is  operating  in  such  a  way  that  t  is 
reachable  from  s,  then  each  cutset  must  fail  to  separate  t  from  s  (but  not  conversely). 
That  is.  each  cutset  \i,.... \„  must  contain  at  least  one  operating  link.  For  an  sf-cutset 
\  let  Q{  \ )  denote  the  probability  that  \  fails  to  disconnect  t  from  s.  Since  it  cannot  be 
true  that  every  edge  e  €  \  has  failed, 

e  €  \ 

.so  we  obtain  an  upper  bound  on  the  .st-reliability 

Rel(G;s.t)  <  n  Q{\,).  (6.29) 

k  =  1 

To  improve  this  bound  we  would  like  to 

(i)  find  an  additional  sf-cvitset  \„  +  ,  edge-disjoint  from 

(ii)  replace  one  of  the  ,st-cutsets  by  another  which  has  a  smaller  value 
of  Q  but  is  still  edge-disjoint  from  the  others. 
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Again,  these  two  goals  are  in  opposition.  Also,  there  is  an  elementary  analogue  of 
Monger's  theorem  that  holds  in  this  situation  [3]: 

The  maximum  number  of  edge-disjoint  sf-cutsets  is  equal 
to  dist(s.f),  the  length  of  the  shortest  path  from  s  to  t 

As  with  the  sf-path  edge-packing,  it  is  not  clear  how  one  should  go  about  optimizing  the 
resulting  upper  bounds,  although  it  is  clear  that  only  minimal  st-cutsets  should  be 
considered  as  candidates  for  the  edge-packing. 

It  is  clear  that  for  any  network  there  is  an  optimal  edge-packing  of  the  desired  type 
which  gives  that  best  corresponding  bound  on  the  st-reliability,  but  the  optimum  will  in 
general  depend  on  the  edge-probabilities.  Xo  efficient  algorithms  are  known  for  doing 
this,  but  it  is  also  not  established  that  the  problem  is  iVP-hard.  In  any  case  these 
procedures  give  a  quick  method  of  finding  some  bounds  on  Rel  {G;s,t).  It  is  not  clear 
how  good  these  bounds  are  in  general,  but  in  test  cases  they  are  competitive  with  other 
algorithms,  and  they  do  apply  to  networks  with  arbitrary  edge  probabilities.  The 
methods  of  Torrieri  [137]  and  Wagner  [13S]  can  be  used  to  generate  edge-packings  of  the 
two  types  described  here. 

However,  it  is  important  to  note  that  even  the  optimal  bounds  cannot  be  uniformly 
good  over  all  possible  network  inputs,  for  we  have  the  following  [43.120]: 


Given  a  probabilistic  network  G.  distinct  nodes  s.  t  G  V’(G), 
and  a  number  e  >  0.  find  r  such  that  ]  r  -  Rel  (G:s.t)  ]  <  e. 
This  pioblem  is  .VP-hard,  even  for  constant  edge  probabilities. 


At  this  point,  one  might  ask  if  the  expected  worst-case  exponentied  growth  in  the 
calculation  of  network  reliability  is  typical  behavior.  For  excimple.  the  well-known 
simplex  algorithm  in  linear  programming  does  require  an  exponential  runtime  for  some 
input  data  sets,  but  such  examples  are  somewhat  contrived.  For  real-world  applications 
the  simplex  performance  proves  to  be  quite  satisfactory.  Unfortunately,  this  is 
definitely  not  the  case  for  the  graph  reliability  algorithms.  Even  for  very  simple 
network  configurations,  such  as  rectangular  grid  networks,  the  rapid  growth  of  the  run 
time  is  obvious.  (However,  the  linear  programming  problem  is  now  known  not  to  be 
.VP-hard.  It  does  admit  solution  by  another  algorithm  which  does  have  polynomial 
complexity. ) 
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Finally,  a  number  of  papers  [139-145]  have  dealt  with  Monte  Carlo  methods  in 
simulating  the  operation  of  the  network  so  as  to  obtain  approximate  values  of  the 
network  reliability.  W’e  will  not  go  into  details  here,  but  in  principle  this  is  simple  to 
do.  .Assuming  statistical  independence  with  given  edge  probabilities  P(e),  e  G  E{G),  we 
draw  a  sample  of  the  state  of  the  network,  say  X  =  (Xj,. . .Yq)  with  .Y,  true  or  false 
according  to  whether  the  ;-th  edge  is  operating  or  not.  Then  either  one  of  the  two 
graph  traversals  described  earlier  (DFS  or  BFS)  can  quickly  determine  if  s  and  t  are 
connected  in  the  graph  given  by  the  configuration  X,  so  t  would  be  reachable  from  s 
with  probability  P(.Y)  if  this  is  the  case.  .Averaging  this  value  over  an  ever  larger 
number  of  random  samples  of  the  state  of  the  network  gives  increasingly  accurate 
approximations  to  the  reliability  Rel((?;s. #),  and  the  variance  of  these  approximations 
can  be  estimated  so  as  to  monitor  convergence  as  the  sample  size  increases.  Such 
convergence,  however,  will  be  very  slow. 
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7.  OTHER  PERFORMANCE  MEASURES 

The  term  "‘reliability”  has  generally  been  reserved  to  refer  only  to  the  probability 
of  the  network  being  connected  with  respect  to  some  subset  of  edges.  Other  measures  of 
network  performance  can  be  used  to  quantify  the  susceptibility  of  the  network  to 
component  failures,  but  such  terms  as  “vulnerability”  or  "‘survivability”  have  no 
generally  accepted  mathematical  definitions.  In  this  section  we  introduce  several  other 
graph  invariants  that  have  application  to  the  general  question  of  network  vulnerability. 
While  these  may  not  have  the  same  intuitive  appeal  as  that  associated  with  st-path 
connectivity,  some  are  easier  to  calculate  and  are  more  appropriate  to  special  types  of 
networks  than  the  reliability  measure. 

We  first  discuss  some  quantities  relating  to  somewhat  different  aspects  of  graph 
connectivity  as  defined  earlier,  and  in  later  sections  consider  other  graph  invariants 
which  are  not  immediately  related  to  connectivity.  Most  of  these,  however,  are 
motivated  by  an  attempt  to  quantify  the  effect  that  the  removal  of  a  set  of  nodes  or 
edges  will  have  on  the  network.  For  example,  knowing  that  X{G)  =  k  iov  a.  given 
connected  graph  G  tells  us  only  that  there  is  some  set  Eq  C  E{G)  of  k  edges  whose 
removal  will  disconnect  G.  This  says  nothing  about  how  badly  disrupted  the  network 
might  have  become.  Perhaps  only  one  vertex  has  been  isolated  by  these  deletions,  or 
the  network  could  have  been  bisected  into  two  pieces  of  nearly  equal  size.  One  would 
presumably  prefer  the  first  configuration  so  that  the  number  of  node  pairs  that  can  still 
communicate  is  as  large  as  possible,  but  the  value  of  the  edge-connectivity  alone  has 
nothing  to  say  about  this. 

There  are  a  number  of  general  schemes  to  quantify  network  survivability,  most  of 
which  give  the  change  in  some  specific  graph  invariant  as  a  function  of  the  size  of  the 
set  of  network  components  which  were  removed,  which  for  us  will  generally  mean  edge 
removal.  .Also,  we  will  restrict  ourselves  to  the  deterministic  eispects  of  these  quantities 
without  reference  to  probabilistic  questions  of  component  failures.  It  should  be 
emphasized  that  much  of  this  material  is  of  recent  origin,  so  the  extent  of  its 
application  and  usefulness  to  network  analysis  is  not  yet  established.  Finally,  the 
problem  of  finding  efficient  algorithms  for  the  calculation  of  the  invariants  in  question  is 
not  at  all  well  developed. 
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7.1  CONNECTIVITY  FACTORS 

If  a  graph  is  not  connected  it  is  normally  analyzed  one  component  at  a  time. 
Quantities  of  particular  interest  would  be  the  number  of  components  and  the  size 
(nodes  and  edges)  of  the  largest  and  smallest  components.  Depending  on  the  purpose  of 
the  communication  network  in  question  we  might  want  to  have  either  the  largest  or  the 
smallest  component  of  the  graph  to  have  as  many  nodes  or  edges  possible.  The  worst 
case  is  obviously  that  of  a  totally  disconnected  graph — every  node  is  isolated  and  there 
are  no  edges.  Some  measures  of  network  vulnerability  that  have  recently  been 
investigated  serve  as  indicators  of  how  susceptible  a  network  is  to  being  reduced  to  such 
a  configuration  [146-152]. 

We  shall  describe  here  the  node-connectivity  factor  (NCF),  which  represents  the 
average  number  of  nodes  that  should  be  removed  from  the  network  in  order  that  the 
remaining  subgraph  be  totally  disconnected.  This  quantity  is  best  defined  recursively. 
First,  if  G  is  disconnected  and  has  components  Gj,  ....  G^,  then  the  NCF  is  simply 
additive. 

NCF(G)  ^  £  NCF(6',).  (7.1) 

I  =  1 

For  G  a  connected  graph  let  k{G)  be  the  size  of  the  smallest  set  Vq  C  V(G)  for  which 
G  —  I'o  is  disconnected  and  let  \,.  ....  \„  be  the  collection  of  all  such  node  sets  Vo- 
Then 

NCF(G)^A-(G)+  i  £  ^^CF(G-x.)  (7.2) 

I  =  I 

Thus  the  NCF  of  a  connected  graph  G  is  defined  in  terms  of  its  value  on  some  of  the 
connected  subgraphs  of  G.  This  sets  up  a  recursion  which  stops  when  the  cutset 
removals  have  reduced  G  to  the  trivial  case,  a  single  isolated  node,  which  has  NCF 
equal  to  zero. 

The  recursive  calculation  of  this  quantity  results  in  a  so-called  decomposition 
diagram.,  which  is  just  the  recursion  tree  of  connected  subgraphs  formed  as  the  minimal 
cutnode  sets  are  deleted.  Unfortunately,  the  size  of  this  tree  grows  exponentially  with 
the  size  of  the  graph  and  it  is  po.ssible  to  compute  the  NCF  only  for  small  networks. 
Some  research  has  been  done  to  investigate  the  effect  of  terminating  the  recursive 
branching  prematurely  [148]  so  as  to  obtain  an  approximate  value  of  NCF(G).  .Another 
method  used  to  control  the  algorithm  is  to  stop  further  function  calls  when  connected 
subgraphs  of  known  type  are  generated  [149,  151],  These  would  be  special  graphs  (e.g.. 
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stars,  complete  graphs,  cycles)  whose  NCF’s  are  already  known  and  have  been  stored  in 
some  data  base  which  is  available  to  the  algorithm.  As  soon  as  one  of  these  graphs  is 
encountered  in  the  decomposition  diagram,  then  its  NCF  value  can  be  obtained  directly 
and  no  further  branching  to  disconnect  this  graph  is  necessary. 

It  is  also  possible  to  measure  the  importance  of  any  individual  node  to  the  value  of 
the  XCF  by  calculating  a  weighted  sum  of  the  number  of  terms  (i.e.,  connected 
subgraphs)  of  the  decomposition  diagram  in  which  that  node  occurs.  In  this  way  one 
has  some  measure  of  the  relative  contributions  of  the  various  v  €  F(G)  to  the  network 
connectivity.  This  information  could  be  used  to  re-allocate  the  network  links  so  as  to 
equalize  the  various  contributions  of  the  graph  vertices,  thereby  making  the  network 
less  vulnerable  to  node  failures  (that  is,  increasing  the  value  of  the  NCF). 

.4  similary  quantity,  the  link-connectivity  factor  (LCF),  has  also  been  investigated. 
This  uses  the  spanning  trees  of  the  components  of  G  and  is  a  measure  of  the  average 
contribution  of  the  network  links  to  maintaining  a  minimally  connected  configuration. 
For  a  connected  (p,  ^)-graph  G  it  is  defined  as 

LCF(G)^(^)r(G),  (7.3) 

where  T{G)  is  the  number  of  spanning  trees  of  G.  (We  remark  that  T{G)  can  be  given 
as  the  determinant  of  a  matrix,  analogous  to  the  adjacency  matrix,  constructed  directly 
from  the  list  of  edge  connections.  Hence  this  number,  although  large,  is  not 
prohibitively  expensive  to  compute  [20].)  If  G  is  not  connected  but  has  components 
G, . G„.  then  its  LCF  is  given  as  [142]. 

LCF(G)  =  i  y  5,  LCF(G,)  (7.4) 

■^1  =  1 

where  5,  is  the  number  of  edges  in  a  spanning  tree  of  G,,  z  =  1,  n  and  5  is  the 
numbei  of  edges  in  a  spanning  tree  for  G  (that  is,  one  obtained  by  interconnecting  the 
spanning  trees  of  the  G,).  It  is  easier  to  calculate  than  the  NCF,  as  it  requires  only  the 
total  number  of  spanning  trees  of  each  component  of  G,  not  the  actual  collection  of  all 
such  trees.  Similarly,  a  method  of  rating  the  relative  contributions  of  the  individual 
edges  to  the  LCF  can  be  defined,  ,4s  with  the  NCF,  network  vulnerability  would 
presumably  be  decreased  by  revising  the  network  links  so  as  to  spread  these 
contributions  more  evenly  throughout  the  network  and  increase  the  value  of  LCF(G). 

Typically  these  heuristics  are  used  in  the  problem  of  designing  or  synthesizing 
survivable  networks  or  improving  the  survivability  of  a  given  network.  That  is,  one  has 
at  least  some  control  over  how  the  network  is  to  be  configured,  or  some  freedom  in 
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rearranging  a  given  configuration.  The  NCF  and  LCF  values  then  can  indicate  the 
advantages  of  one  particular  resource  allocation  versus  another,  hopefully  leading  to  an 
optimal  arrangement.  The  result  would  be  a  network  in  which  no  one  part  of  the 
network  is  much  more  vulnerable  ‘o  component  failure  than  any  other  [147]. 

7.2  NETWORK  DIAMETER 

Given  a  graph  G.  the  diameter  of  G  is  the  maximum  hop  distance  supported  by  the 
edge  configuration. 

Diam(G)  =  Max{dist(u.  e):  u.  u  G  K(G)}.  (7.5) 

which  is  to  be  interpreted  as  +  oc  if  G  is  not  connected.  .A  number  of  recent  researches 
[153-166]  have  used  this  as  a  measure  of  connectivity,  a  graph  with  small  diameter  being 
considered  as  better-connected  than  one  with  a  larger  diameter.  This  allows  a  more 
quantitive  measure  of  connectivity.  For  example,  whereas  a  graph  is  connected  if  and 
only  if  Diam(G)  <  +  oc,  a  separating  set  of  edges  EqCE(G)  is  one  for  which 
Diam(G  —  Eq)  =  -f  oo.  Rather  than  looking  for  such  a  drastic  change,  we  may  be  more 
careful  and  ask  for  more  detail  about  the  difference  between  Diam(G)  and 
Diam(G  — Eq)  for  various  subsets  Eq  of  £(G).  Other  authors  have  used  the  average 
distance  between  nodes  instead  of  the  diameter  [155.  158.  160.  165]. 

-An  early  example  of  this  is  the  so-called  persistence  of  a  connected  graph,  which  is 
defined  to  be  the  size  of  the  smallest  set  EqC  E{G)  with  Diam  (G)  <  Diam(G  —  £„). 
That  is. 

Pers(G)  =  Min{  \  Eq\:EqC  E{G).  Diam(G)  <  Diam(G  -  Eq)}- 

.A  number  of  related  quantities  will  l)e  considered  in  this  section,  mainly  by  specifying 
the  size  of  the  set  of  nodes  or  edges  deleted  from  the  network.  However,  we  shall 
restrict  ourselves  primarily  to  edge  deletions.  This  is  because,  as  already  noted  earlier 
for  path  connectivity,  the  removal  of  a  set  of  nodes  from  a  graph  can  give  a  wide  variety 
of  distinctly  different  results.  .An  example  of  this  for  a  connected  graph  is  shown  in 
Figure  7.1.  In  this  example,  apart  from  from  the  case  where  the  graph  becomes 
disconnected,  node  removal  can  either  increase  or  decrease  the  diameter.  This  sort  of 
behavior  is  not  possible  for  edge  removal,  which  always  results  in  an  increase  in  the 
diameter  [163.  166].  That  is.  the  deletion  of  edges  can  never  improve  the  connectivity 
measure,  and  this  monotone  property  is  very  advantageous. 
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Diam(G-v)=4  Diam(G-w)  =  +  (» 

FIGURE  7.1  CHANGE  IN  DIAMETER  VIA  NODE  DELETION 


A  more  informative  quantity  Ccin  be  defined  under  the  general  concept  of  leverage, 
[153]  which  allows  a  very  broad  method  of  quantifying  changes  in  a  given  graph 
invariant  with  respect  to  the  number  of  network  components  under  consideration. 
Restricting  ourselves  to  the  diameter  measure  and  edge  deletions,  we  can  define  the 
leverage  (more  precisely,  the  leverage  sequences)  of  G  as 

I-(A:)4Max{Diam(G-Eo)-Diam(G)  :EoCF:(G),  |Eol  =  ^},  (7.7a) 
^-(it)^Min{Diam(G-Eo)-Diam(G)  :  EoCE(G),  |Eol  =k}.  (7.7b) 

The  negative  superscript  here  refers  to  the  fact  that  k  edges  of  the  graph  have  been 
deleted,  but  we  may  well  want  to  know  the  effect  of  adjoining  edges  to  the  network. 
This  leads  to  the  definitions 

L  +  {k)^  Max{Diam(G)  -  Diam(G  +  EJ:  ]  i  =  fc},  (7.8a) 

^  +  (/t)  =  Min{Diam(G)-Diam(G  +  Ei):  |Ei  I  =k}.  (7.8b) 

Here  the  maximum  and  minimum  are  over  all  sets  of  k  edges  which  are 
complementary  to  G.  (That  is.  we  consider  G  as  a  subgraph  of  and  add  edges  of  Kp 
which  are  not  in  E{G).  ) 
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The  leverage  sequences  have  a  lot  of  information  in  them  and  subsume  many  more 
elementary  graph-theoretic  quantities.  For  example,  the  persistence  is  simply 

Pers{G)  =  Mm{k  >  1:1  "(A-)  >  0}.  (7.9) 

Knowing  that  k  =  Pers(G)  tells  us  that  the  diameter  of  G  is  increased  by  the  removal  of 
some  set  of  k  edges,  but  it  does  not  indicate  the  magnitude  of  this  increase.  Another 
example  of  this  is  the  definition  of  a  critical  graph,  which  here  means  edge-critical  (with 
respect  to  the  diameter).  For  this  we  want  £‘'■(1)  >  0,  so  the  diameter  of  G  is  increased 
if  any  single  edge  of  G  is  removed. 

Definitions  such  as  leverage  are  sometimes  referred  to  as  second-order 
measurements  [153,  154],  meaning  that  the  originally  defined  diameter  is  a  first-order 
quantity  and  the  leverage  is  defined  in  terms  of  changes  in  that  term.  The  number  of 
possibilities  here  is  without  limit,  for  in  place  of  Diam(G)  we  could  substitute  any  other 
graph  invariant  (e.g.,  the  number  of  components,  minimal  or  maximal  degree,  etc).  For 
example  we  would  get  a  mixed  connectivity  measure  if  we  replaced  Diam(G)  with  the 
vertex-connectivity  k(G) — we  would  be  measuring  the  change  in  vertex-  nnectivity  as  a 
function  of  the  number  of  edges  being  added  or  removed. 

Research  into  the  availability  and  applicability  of  these  quantities  is  quite  recent, 
and  most  of  the  work  to  date  has  been  devoted  to  finding  upper  bounds  on  the  leverage 
and  determining  its  extremal  values  over  a  restricted  class  of  graphs.  We  cite  some  of 
these  results  to  illustrate  the  behavior  of  these  ideas. 

(i)  If  \{G)  >  2  then  L~{1)  <  Diam(G).  Otherwise  stated, 

Diam(G  —  e)  <  2  Diam(G)  for  all  edges  e  G  E{G). 

(ii)  If  A(G)  =  n  with  G  an  n-regular  graph,  then  L  “  (n  -  1)  >  0. 

(iii)  Of  all  the  n-regular  graphs  with  A(G)  =  n,  the  n-dimensional 
hypercube  Q„  has  the  most  resistance  to  diameter  increases 

by  the  removal  of  edges.  Its  leverage  sequence  is  (0.0.. ..,0,1.  -I-  oo), 
where  the  1  occurs  in  the  (n  —  l)th  position. 

With  respect  to  this  last  example,  we  note  that  the  u-dimensional  hypercube  Q„ 
can  be  defined  by  specifying  its  node  set  to  be  the  integers  {0,1.....2”  —  1}  and 
connecting  nodes  i.  and  j  whenever  the  binary  representations  of  i  and  j  differ  in  exactly 
one  binary  digit.  We  have  Diam(Q„)  =  n  and  no  removal  of  less  than  n  —  1  edges  will 
change  the  diameter.  It  is  possible  to  increase  the  diameter  to  n  -t-  1  by  removal  of 
some  set  of  n  -  1  edges,  and  to  disconnect  Q„  by  removing  some  other  set  of  n  -|- 1 
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edges.  Thus  the  graph  is  an  extremal  graph  for  this  leverage  sequence  in  the  class  of 
all  rj-regular  graphs  with  edge-connectivity  equal  to  n. 

This  is  the  type  of  behavior  a  network  should  have  in  order  to  be  relatively 
invulnerable  to  link  failures,  but  the  class  of  graphs  referred  to  here  is  not  appropriate 
for  mobile  communication  networks.  More  research  needs  to  be  done  in  applying  these 
concepts  to  communication  networks,  and  for  many  of  these  quantities  no  efficient 
algorithms  are  as  yet  available  for  their  calculation. 

7.3  MEASURES  OF  VULNERABILITY 

Several  other  quantities  of  relevence  to  the  vulnerability  of  a  network  can  be 
defined  in  terms  of  graph  connectivity  [167-179].  Given  t  >  0,  a  connected  graph  G  is 
said  to  be  t-tough  if  for  all  Vq  C  V{G)  for  which  G  —  Uq  is  disconnected  we  have 

t<\V^\/ciG-Vo).  (7.10) 

The  toughness  t{G)  is  given  as  [3] 

t{G)  ^  Max{f  >  0  :  G  is  t-tough}.  (7.11) 

This  graph  invariant  has  been  used  in  researches  on  Hamiltonian  cycles  and  to  obtain 
lower  bounds  on  the  circumference  (the  length  of  the  longest  cycle)  of  a  graph.  As 
defined  above  the  toughness  depends  on  the  removal  of  a  set  of  vertices  of  G,  but  an 
obvious  analogue  with  edge  deletions  gives  meaning  to  the  edge-toughness  of  a  graph. 

.Another  quantity  of  interest  refers  to  the  quantity  m(G),  the  number  of  nodes  in 
the  largest  component  of  G.  Using  this  we  can  consider  the  so-called  integrity  [146].  For 
Ug  C  U(G)  we  define 

Int(V^o)^|  Ug  I  +  m(G-Ug),  (7.12) 

and  the  integrity  of  the  graph  is 

Int(G)  ^  Min{Int  (Ug):  Ug  C  V{G)}.  (7.13) 

(Strictly  speaking  we  should  refer  to  this  as  the  vertex-integrity  of  G,  with  a 
corresponding  edge-integrity  given  by  deleting  edges  of  G  instead  of  vertices.) 
Minimizing  Int(Ug)  over  Ug  C  V'^(G)  involves  a  trade-off  between  forcing  jUgj  to  be  small 
against  making  m(G-Ug)  small.  Thus  networks  become  less  vulnerable  to  component 
failures  as  the  integrity  measure  becomes  larger,  for  if  G'  is  a  subgraph  of  G  then  it  is 
always  true  that  IntfG')  <  Int(G). 
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Finally,  the  very  definition  of  graph  connectivity  can  be  made  relative  to  other 
graph- theoretic  conditions.  That  is,  if  p  is  some  graph  property  satisfied  by  a  connected 
graph  G,  we  could  require  that  any  disconnection  of  G  result  only  in  connected 
components  that  also  satisfied  property  p  [162,  164]. 

Measures  of  graph  vulnerability  can  also  be  given  directly  in  terms  of  the  vertex-to- 
vertex  connections  specified  by  the  edges  of  the  graph.  A  set  Vq  CV{G)  is  said  to  be  a 
dominating  set  if  every  v  G  V{G)  —  Fq  is  adjacent  to  some  vertex  Vq  €  V q.  A  dominating 
set  of  smallest  cardinality  is  called  a  minimum  dominating  set  and  its  cardinality, 
denoted  by  cr(G),  is  the  domination  number  of  G  [4,  13.  20].  For  example,  if  some  of 
the  nodes  in  a  communication  network  had  only  a  receiving  capability  (they  cannot 
transmit,  hence  cannot  act  as  relays),  then  the  transmitting  nodes  should  be  a 
dominating  set — the  receiver  nodes  must  have  direct  communication  links  to  the 
transmitting  vertices.  We  observe  that  if  G'  is  a  spanning  subgraph  of  G,  then 
(t{G')  =  <t{G)  and  that  consequently  a{G  -  Eq)  >  a{G)  for  every  set  of  edges  Eq  C  E{G). 

The  domination  number  itself  is  not  a  good  measure  of  vulnerability,  for  both  the 
complete  graph  A'p  and  any  star  have  <t(G)  =  1.  Of  more  interest  [171]  is  the  bondage 
number  b(G),  which  is  the  smallest  number  of  edges  whose  removal  increases  the 
domination  number,  or 

6(G)  =  Min{l  Eol:EoCE(Gh  or(G)  <  a(G  -  E^)}  (7.14) 

(This  could  also  be  defined  in  terms  of  a  leverage  measured  with  respect  to  the 
domination  number.)  Thus  the  deletion  of  some  set  of  b(G)  edges  will  destroy  the 
domination  properly  of  every  minimum  dominating  set  of  G,  requiring  that  at  least  one 
more  transmitter  would  be  necessary  to  restore  full  communication. 

Upper  bounds  on  the  bondage  number  can  be  given  in  terms  of  several  other  graph 
invariants,  including  the  domination  number  <t(G).  If  we  assume  that  G  is  connected 
and  non-trivial.  then  the  following  results  can  be  found  in  the  literature: 

(i)  6(A'p)  =  rp/2l 

(ii)  b{G)<p-l 

(iii)  6(G)  <  p+l  -  (r{G) 

(iv)  If  cr(G)  >  2.  then  6(G)  <  A(G)  [^’■(G)  —  1]  -|-  1 

(v)  6(G)  <  Min{deg(n)  +  deg(e)  -  1:  u.v  €  F(G),  adjacent} 

For  good  survivability.  6(G)  should  be  large,  so  trees  and  paths  are  most  vulnerable 
since  their  bondage  number  is  small.  In  fact,  for  any  tree  T  we  have  b{T)  <  2.  with 
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h{T)  =  1  if  any  v  G  V[T)  is  adjacent  to  two  or  more  pendant  nodes. 

There  are  many  variations  on  the  definition  of  dominating  sets.  For  example, 
given  I'o  C  V{G)  and  any  integer  k  >  1,  we  could  require  that  each  v  G  V{G)  —  Vq  be 
adjacent  to  at  least  k  vertices  of  Vq-  Alternatively,  we  could  ask  that  each 
V  €  V{G)  —  V^o  be  within  hop  distance  at  most  k  to  some  Cq  G  in  both  of  these 

situations  the  case  k  =  I  is  the  usual  concept  of  domination.  A  slightly  stronger 
constraint  is  that  these  conditions  hold  for  aJl  u  G  V{G),  not  merely  those  in  F(G)  —  Vq. 
Also.  one  may  consider  the  analogous  concept  of  edge-domination,  where  Eq  C  E{G)  is 
said  to  dominate  G  if  for  every  edge  e  G  E(G)  —  Eq  there  exists  Cq  G  Eq  so  that  e  and  Cg 
are  incident  to  some  common  node  Vq  G 

.A  slightly  different  idea  is  that  of  a  graph  covering.  Here  V^g  ^  ^(G)  is  said  to  be  a 
covering  set  of  G  if  each  e  £  E{G)  is  incident  to  some  node  t>g  G  Vq.  The  covering 
number.  Cov(G),  of  G  is  then  the  cardinality  of  the  smallest  covering  set  of  G. 
Similarly,  Eq  C  E{G)  is  an  edge  cover  of  G  if  every  v  G  V{G)  is  incident  to  some  edge 
eg  G  Eq.  Note  the  duality  between  dominating  sets  and  covering  sets  in  that  nodes 
dominate  other  nodes  of  G  but  cover  edges.  Similarly,  edges  dominate  other  edges  but 
cover  nodes  of  the  graph  [21]. 

.4  complementary  concept  is  that  of  independence  in  a  graph,  which  requires  non¬ 
adjacency  where  domination  insists  on  adjacency.  Thus  Vq  C  V{G)  is  an  independent 
set  of  nodes  if  no  two  elements  of  Vg  are  adjacent,  and  Eq  G  E{G)  is  an  independent 
edge  set  if  no  two  edges  in  Eq  are  incident  to  a  common  vertex  Vq  G  V{G].  (An 
independent  edge  set  is  also  called  a  matching  of  G.)  The  node-  (resp.  edge-) 
independence  number.  Ind(G),  is  then  the  cardinality  of  the  largest  set  of  independent 
nodes  (resp.  edges)  in  G.  Clearly,  the  complement  of  a  node  cover  of  G  is  an 
independent  set,  and  conversely. 

Numerous  relationships  hold  among  these  extreme  values.  If  we  use  the  subscripts 
e  and  v  to  distinguish  between  sets  of  edges  and  nodes  of  G,  respectively,  then  the 
following  statements  are  true  [22]: 


DomjG)  <  Ind,(G)  <Cov,.(G). 

(7.15a) 

Dom^.jG)  <  Indy(G)  <  CoVp(G). 

(7.15b) 

Ind,(G)  +  CovJG)  =  p. 

(7.15c) 

Ind,,(G)  -f  Cov^,(G)  =  p. 

(7.15d) 
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Also,  as  in  the  case  of  dominating  sets,  the  notions  of  covering  sets  and 
independent  sets  can  be  further  extended  and  quantified,  and  leverage  sequences  can  be 
defined  in  terms  of  each  one.  These  invariants  have  been  used  in  applications  to 
network  scheduling  problems,  particularly  those  involved  with  paxallel  data  processing. 
While  they  all  have  some  relevence  to  network  vulnerability  in  the  broad  sense,  their 
more  direct  bearing  on  vulnerability  and  survivability  requires  much  more  investigation. 
Finally,  some  of  these  invariants  again  lead  to  computationally  intractable  problems. 
For  example,  given  a  graph  G  =  G{V,E)  and  an  integer  k  <  |  V{G)  |  ,  the  question  of 
deciding  whether  there  exists  a  vertex  covering  of  G  having  cardinality  at  most  k  is 
known  to  be  .VP-complete,  and  the  same  is  true  if  we  ask  for  an  independent  set  of  at 
most  k  vertices.  This  is  not  the  case  for  independent  edges,  however,  as  an  algorithm 
due  to  Edmonds  [4,20]  will  find  maximum  matchings  with  complexity  0(p‘*).  In 
addition,  some  of  the  corresponding  questions  for  stochastic  graphs  can  be  shown  to  be 
.VP-hard  [167]. 
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8.8  PARALLEL  GRAPH  ALGORITHMS 
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8.9  STATE  SPACE  ENUMERATION 
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8.10  INCLUSION-EXCLUSION  METHODS 
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8.17  GRAPH  DIAMETER  AND  CONNECTIVITY 
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8.18  OTHER  MEASURES  OF  VULNERABILITY 

The  remaining  references  are  concerned  with  a  number  of  other  graph  invariants 
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